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PREFACE 

The  matrix  calculus  is  widely  applied  nowadays  in  various  branches  of 
mathematics,  mechanics,  theoretical  physics,  theoretical  electrical  enp^ineer- 
ing,  etc.  However,  neither  in  the  Soviet  nor  the  foreign  literature  is  there  a 
book  that  gives  a  sufficiently  complete  account  of  the  problems  of  matrix 
theory  and  of  its  diverse  applications.  The  present  book  is  an  attempt  to  fill 
this  gap  in  the  mathematical  literature. 

The  book  is  based  on  lecture  courses  on  the  theory  of  matrices  and  its 
applications  that  the  author  has  given  several  times  in  the  course  of  the  last 
seventeen  years  at  the  Universities  of  Moscow  and  Tiflis  and  at  the  Moscow 
Institute  of  Physical  Technology. 

The  book  is  meant  not  only  for  mathematicians  (undergraduates  and 
research  students)  but  also  for  specialists  in  allied  fields  (physics,  engi- 
neering) who  are  interested  in  mathematics  and  its  applications.  Therefore 
the  author  has  endeavoured  to  make  his  account  of  the  material  as  accessible 
as  possible,  assuming  only  that  the  reader  is  acquainted  with  the  theory  of 
determinants  and  with  the  usual  course  of  higher  mathematics  within  the 
programme  of  higher  technical  education.  Only  a  few  isolated  sections  in 
the  last  chapters  of  the  book  require  additional  mathematical  knowledge  on 
the  part  of  the  reader.  Moreover,  the  author  has  tried  to  keep  the  indi- 
vidual chapters  as  far  as  possible  independent  of  each  other.  For  example, 
Chapter  V,  Functions  of  Matrices,  does  not  depend  on  the  material  con- 
tained in  Chapters  II  and  III.  At  those  places  of  Chapter  V  where  funda- 
mental concepts  introduced  in  Chapter  IV  are  being  used  for  the  first  time, 
the  corresponding  references  are  given.  Thus,  a  reader  who  is  acquainted 
with  the  rudiments  of  the  theory  of  matrices  can  immediately  begin  with 
reading  the  chapters  that  interest  him. 

The  book  consists  of  two  parts,  containing  fifteen  chapters. 

In  Chapters  I  and  III,  information  about  matrices  and  linear  operators 
is  developed  ah  initio  and  the  connection  between  operators  and  matrices 
is  introduced. 

Chapter  II  expounds  the  theoretical  basis  of  Gauss's  elimination  method 
and  certain  associated  effective  methods  of  solving  a  system  of  n  linear 
equations,  for  large  n.  In  this  chapter  the  reader  also  becomes  acquainted 
with  the  technique  of  operating  with  matrices  that  are  divided  into  rectan- 
gular 'blocks.' 
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In  Chapter  IV  we  introduce  the  extremely  important  'characteristic' 
and '  minimal '  polynomials  of  a  square  matrix,  and  the  '  adjoint '  and '  reduced 
adjoint'  matrices. 

In  Chapter  V,  which  is  devoted  to  functions  of  matrices,  we  give  the 
general  definition  of  f(A)  as  well  as  concrete  methods  of  computing  it — 
where  f(X)  is  a  function  of  a  scalar  argument  X  and  A  is  a  square  matrix. 
The  concept  of  a  function  of  a  matrix  is  used  in  §§  5  and  6  of  this  chapter 
for  a  complete  investigation  of  the  solutions  of  a  system  of  linear  differen- 
tial equations  of  the  first  order  with  constant  coefficients.  Both  the  concept 
of  a  function  of  a  matrix  and  this  latter  investigation  of  differential  equa- 
tions are  based  entirely  on  the  concept  of  the  minimal  polynomial  of  a  matrix 
and — in  contrast  to  the  usual  exposition — do  not  use  the  so-called  theory  of 
elementary  divisors,  which  is  treated  in  Chapters  VI  and  VII. 

These  five  chapters  constitute  a  first  course  on  matrices  and  their  appli- 
cations. Very  important  problems  in  the  theory  of  matrices  arise  in  con- 
nection with  the  reduction  of  matrices  to  a  normal  form.  This  reduction 
is  carried  out  on  the  basis  of  Weierstrass'  theory  of  elementary  divisors. 
In  view  of  the  importance  of  this  theory  we  give  two  expositions  in  this 
book :  an  analytic  one  in  Chapter  VI  and  a  geometric  one  in  Chapter  VII. 
We  draw  the  reader's  attention  to  §§  7  and  8  of  Chapter  VI,  where  we  study 
effective  methods  of  finding  a  matrix  that  transforms  a  given  matrix  to 
normal  form.  In  §  8  of  Chapter  VII  we  investigate  in  detail  the  method 
of  A.  N.  Krylov  for  the  practical  computation  of  the  coefficients  of  the 
characteristic  polynomial. 

In  Chapter  VIII  certain  types  of  matrix  equations  are  solved.  We  also 
consider  here  the  problem  of  determining  all  the  matrices  that  are  permutable 
with  a  given  matrix  and  we  study  in  detail  the  many-valued  functions  of 
matrices  "*\/3~and  In  A. 

Chapters  IX  and  X  deal  with  the  theory  of  linear  operators  in  a  unitary 
space  and  the  theory  of  quadratic  and  hermitian  forms.  These  chapters  do 
not  depend  on  Weierstrass'  theory  of  elementary  divisors  and  use,  of  the 
preceding  material,  only  the  basic  information  on  matrices  and  linear  opera- 
tors contained  in  the  first  three  chapters  of  the  book.  In  §  9  of  Chapter  X 
we  apply  the  theory  of  forms  to  the  study  of  the  principal  oscillations  of  a 
system  with  n  degrees  of  freedom.  In  §  11  of  this  chapter  we  give  an  account 
of  Frobenius'  deep  results  on  the  theory  of  Hankel  forms.  These  results  are 
used  later,  in  Chapter  XV,  to  study  special  cases  of  the  Routh-Hurwitz 
problem. 

The  last  five  chapters  form  the  second  part  of  the  book  [the  second 
volume,  in  the  present  English  translation].  In  Chapter  XI  we  determine 
normal  forms  for  complex  symmetric,  skew-symmetric,  and  orthogonal  mat- 
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rices  and  establish  interesting  connections  of  these  matrices  with  real  matrices 
of  the  same  classes  and  with  unitary  matrices. 

In  Chapter  XII  we  expound  the  general  theory  of  pencils  of  matrices  of 
the  form  A  +  XB,  where  A  and  B  are  arbitrary  rectangular  matrices  of  the 
same  dimensions.  Just  as  the  study  of  regular  pencils  of  matrices  A  -^  XB 
is  based  on  Weierstrass'  theory  of  elementary  divisors,  so  the  study  of  singu- 
lar pencils  is  built  upon  Kronecker's  theory  of  minimal  indices,  which  is,  as 
it  were,  a  further  development  of  Weierstrass 's  theory.  By  means  of  Kron- 
ecker's theory — the  author  believes  that  he  has  succeeded  in  simplifying  the 
exposition  of  this  theory — we  establish  in  Chapter  XII  canonical  forms  of 
the  pencil  of  matrices  A  +  XB  in  the  most  general  case.  The  results  obtained 
there  are  applied  to  the  study  of  systems  of  linear  differential  equations 
with  constant  coefficients. 

In  Chapter  XIII  we  explain  the  remarkable  spectral  properties  of  mat- 
rices with  non-negative  elements  and  consider  two  important  applications 
of  matrices  of  this  class :  1 )  homogeneous  Markov  chains  in  the  theory  of 
probability  and  2)  oscillatory  properties  of  elastic  vibrations  in  mechanics. 
The  matrix  method  of  studying  homogeneous  Markov  chains  was  developed 
in  the  book  [46]  by  V.  I.  Romanovskii  and  is  based  on  the  fact  that  the  matrix 
of  transition  probabilities  in  a  homogeneous  Markov  chain  with  a  finite 
number  of  states  is  a  matrix  with  non-negative  elements  of  a  special  type 
(a  'stochastic'  matrix). 

The  oscillatory  properties  of  elastic  vibrations  are  connected  with  another 
important  class  of  non-negative  matrices — the  '  oscillation  matrices. '  These 
matrices  and  their  applications  were  studied  by  M.  G.  Krein  jointly  with 
the  author  of  this  book.  In  Chapter  XIII,  only  certain  basic  results  in  this 
domain  are  presented.  The  reader  can  find  a  detailed  account  of  the  whole 
material  in  the  monograph  [17], 

In  Chapter  XIV  we  compile  the  applications  of  the  theory  of  matrices 
to  systems  of  differential  equations  with  variable  coefficients.  The  central 
place  (§§  5-9)  in  this  chapter  belongs  to  the  theory  of  the  multiplicative 
integral  (Produktintegral)  and  its  connection  with  Volterra's  infinitesimal 
calculus.  These  problems  are  almost  entirely  unknown  in  Soviet  mathe- 
matical literature.  In  the  first  sections  and  in  §  11,  we  study  reducible 
systems  (in  the  sense  of  Lyapunov)  in  connection  with  the  problem  of  stabil- 
ity of  motion ;  we  also  give  certain  results  of  N.  P.  Erugin.  Sections  9-11 
refer  to  the  analytic  theory  of  systems  of  differential  equations.  Here  we 
clarify  an  inaccuracy  in  Birkhoff 's  fundamental  theorem,  which  is  usually 
applied  to  the  investigation  of  the  solution  of  a  system  of  differential  equa- 
tions in  the  neighborhood  of  a  singular  point,  and  we  establish  a  canonical 
form  of  the  solution  in  the  case  of  a  regular  singular  point. 
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In  §  12  of  Chapter  XIV  we  give  a  brief  survey  of  some  results  of  the 
fundamental  investigations  of  I.  A.  Lappo-Danilevskii  on  analytic  functions 
of  several  matrices  and  their  applications  to  differential  systems. 

The  last  chapter,  Chapter  XV,  deals  with  the  applications  of  the  theory 
of  quadratic  forms  (in  particular,  of  Hankel  forms)  to  the  Routh-Hurwitz 
problem  of  determining  the  number  of  roots  of  a  polynomial  in  the  right 
half -plane  (Re  0  >  0) .  The  first  sections  of  the  chapter  contain  the  classical 
treatment  of  the  problem.  In  §  5  we  give  the  theorem  of  A.  M.  Lyapunov  in 
which  a  stability  criterion  is  set  up  which  is  equivalent  to  the  Routh-Hurwitz 
criterion.  Together  with  the  stability  criterion  of  Routh-Hurwitz  we  give, 
in  §  11  of  this  chapter,  the  comparatively  little  known  criterion  of  Lienard 
and  Chipart  in  which  the  number  of  determinant  inequalities  is  only  about 
half  of  that  in  the  Routh-Hurwitz  criterion. 

At  the  end  of  Chapter  XV  we  exhibit  the  close  connection  between  stabil- 
ity problems  and  two  remarkable  theorems  of  A.  A.  Markov  and  P.  L. 
Chebyshev,  which  were  obtained  by  these  celebrated  authors  on  the  basis  of  the 
expansion  of  certain  continued  fractions  of  special  types  in  series  of  decreas- 
ing powers  of  the  argument.    Here  we  give  a  matrix  proof  of  these  theorems. 

This,  then,  is  a  brief  summary  of  the  contents  of  this  book. 


F.  R.  Gantmacher 
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CHAPTER   XI 

COMPLEX  SYMMETRIC,  SKEW-SYMMETRIC,  AND 
ORTHOGONAL  MATRICES 

In  Volume  I,  Chapter  IX,  in  connection  with  the  study  of  linear  operators 
in  a  euclidean  space,  we  investigated  real  symmetric,  skew-symmetric,  and 
orthogonal  matrices,  i.e.,  real  square  matrices  characterized  by  the  relationsf- 

ST  =  S,K''  =  —  K,  and  Q''  =  Q-\ 

respectively  (here  Q"""  denotes  the  transpose  of  the  matrix  Q).  We  have 
shown  that  in  the  field  of  complex  numbers  all  these  matrices  have  linear 
elementary  divisors  and  we  have  set  up  normal  forms  for  them,  i.e.,  'simplest' 
real  symmetric,  skew-symmetric,  and  orthogonal  matrices  to  which  arbitrary 
matrices  of  the  tj^es  under  consideration  are  real-similar  and  orthogonally 
similar. 

The  present  chapter  deals  with  the  investigation  of  complex  symmetric, 
skew-symmetric,  and  orthogonal  matrices.  We  shall  clarify  the  question 
of  what  elementary  divisors  these  matrices  can  have  and  shall  set  up  normal 
forms  for  them.  These  forms  have  a  considerably  more  complicated  struc- 
ture than  the  corresponding  normal  forms  in  the  real  case.  As  a  preliminary, 
we  shall  establish  in  the  first  section  interesting  connections  between  com- 
plex orthogonal  and  unitary  matrices  on  the  one  hand,  and  real  symmetric, 
skew-symmetric,  and  orthogonal  matrices  on  the  other  hand. 

§  1.    Some  Formulas  for  Complex  Orthogonal  and  Unitary  Matrices 

1.    We  begin  with  a  lemma : 

Lemma  1  :^  1.  If  a  matrix  O  is  both  hermiticm  and  orthogonal  {G'^=G  = 
G~'^),  then  it  can  he  represented  in  the  form 

0  =  le^^,  (1) 

where  I  is  a  real  symmetric  involutory  matrix  and  K  a  real  skew-symmetric 
matrix  permutable  with  it : 


iSee  [169],  pp.  223-225. 

t  In  this  and  in  the  following  chapters,  a  matrix  denoted  by  the  letter  Q  is  not  neces- 
sarily orthogonal. 
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i=:i=r  ,p=e,k=k=—e:^.  (2) 

2.   If,  in  addition,  G  is  a  positive-definite  hermitian  matrix,^  then  in 
(1)  I  =  E  and 

G  =  e*K,  (3) 

Proof.    Let 

G  =  S  +  iT,  (4) 


where  8  and  T  are  real  matrices.    Then 

§=:S  —  iT  and  G'^  =  8^  +  iT. 


(5) 


Therefore  the  equation  G  =  G'^  implies  that  S  =  8^  and  T  =  —T'^,  i.e.,  S  is 
symmetric  and  T  skew-symmetric. 

Moreover,  when  the  expressions  for  G  and  G  from  (4)  and  (5)  are  sub- 
stituted in  the  complex  equation  GG^E,  it  breaks  up  into  two  real  equations : 


+  T^  =  E  and  ST  =  T8. 


(6) 


The  second  of  these  equations  shows  that  S  and  T  commute. 
By  Theorem  12'  of  Chapter  IX  (Vol.  I,  p.  292),  the  commuting  normal 
matrices  8  and  T  can  be  carried  simultaneously  into  quasi-diagonal  form  by 
a  real  orthogonal  transformation.    Therefore^ 


S  —  Q  [Sii  Sj,  S2i  ^2,  .  .  . ,  Sg,  Sgy  ^2 j+ 1 ,  •  •  •  ,  5„ j  Q~   , 


fll     0    t, 

\\\-h    0 


0    t. 

0    «, 

> 

-h  0 

>  •  •  •  > 

-h  0 

,0 ol 


(e=«=e^-')  (7) 


C-" 


(the  numbers  s*  and  ti  are  real).    Hence 
G=8-\-iT 


\\\—%t^  «i 


52      i^2 


—  It^    82 


-itg   Sg\ 


>  *2j+l»  •••>*nr 


Q-^-  (8) 


On  the  other  hand,  when  we  compare  the  expressions  (7)  for  ;S^  and  T 
with  the  first  of  the  equations  (6),  we  find: 

4-n  =  l,     sl-ti=l,..,,    sl-t^g  =  l,    52,+i=±l,...,    s^=±l.     (9) 


2  I.e.,  G  is  the  coefficient  matrix  of  a  positive-definite  hermitian  form   (see  Vol.  I, 
Chapter  X,  §9). 

3  See  also  the  Note  following  Theorem  12'  of  Vol.  I,  Chapter  IX  (p.  293). 
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Now  it  is  easy  to  verify  that  a  matrix  of  the  type 
can  always  be  represented  in  the  form 


8  it 
—its 


S      It 
—it    8 


=  ee 


■9     0 


where 


1 5 1  =  cosh  q)y     st  =  sinh  9? ,     e  =  sign  «. 


Therefore  we  have  from  (8)  and  (9)  : 


til     ^     ^^\\  i\\     °     ''•II  <|1     ^     ''*l 


with  s2  —  <2  =  1 


G?=0{±ell-^^  °",±c""^'  '\...,±e^^'''^  '",±1....,±1}0-^  (10) 
i.e., 

G  =  Ie*K  . 


where 


/=Q{±i,±i,...,±i}g-S 


0  9), 

1 

0     <P, 

-<Pi    0 

»  •  •  •  > 

-9,    0 

,0,...,0^i 


(11) 


and 


IK  =  KI. 


From  (11)  there  follows  the  equation  (2). 

2.  If,  in  addition,  it  is  known  that  (r  is  a  positive-definite  hermitian 
matrix,  then  we  can  state  that  all  the  characteristic  values  of  G  are  positive 
(see  Volume  I,  Chapter  IX,  p.  270).  But  by  (10)  these  characteristic  values 
are 

±  6^s  ±  e-^s  ±  e^s  ±  e-^«,  ...,  ±  e^9^  ±  e"''?,  ±  1,  . . . ,  ±  1 

(here  the  signs  correspond  to  the  signs  in  (10) ). 

Therefore  in  the  formula  (10)  and  the  first  formula  of  (11),  wherever 
the  sign  dz  occurs,  the  +  sign  must  hold.    Hence 

I  =  Q{1,1,...,1}Q-'  =  E, 

and  this  is  what  we  had  to  prove. 

This  completes  the  proof  of  the  lemma. 
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By  means  of  the  lemma  we  shall  now  prove  the  following  theorem : 

Theorem  1 :    Every  complex  orthogonal  matrix  Q  can  he  represented  in 
the  form 

Q  =  EeiK^  (12) 

where  B  is  a  real  orthogonal  matrix  and  K  a  real  skew-symmetric  matrix 

B  =  B  =  B-^-\    K  =  K  =  —  K'^.  (13) 

Proof.    Suppose  that  (12)  holds.    Then 

and 

Q'^Q^e^^B''Be^^=e^^^. 

By  the  preceding  lemma  the  required  real  skew-symmetric  matrix  K  can 
be  determined  from  the  equation 

Q*Q=e^iK  (14) 

because  the  matrix  Q*Q  is  positive-definite  hermitian  and  orthogonal.  After 
K  has  been  determined  from  (14)  we  can  find  B  from  (12)  : 

B=Qe-^^.  (15) 

Then 

E*B  =  e-»^0*<?e-»^  =  E  ; 

i.e.,  B  is  unitary.  On  the  other  hand,  it  follows  from  (15)  that  B,  as  the 
product  of  two  orthogonal  matrices,  is  itself  orthogonal :  B'^B  =  E.  Thus 
B  is  at  the  same  time  unitary  and  orthogonal,  and  hence  real.  The  formula 
(15)  can  be  written  in  the  form  (12). 

This  proves  the  theorem.^ 

Now  we  establish  the  following  lemma : 

Lemma  2 :    If  a  matrix  D  is  both  symmetric  and  unitary  {D^D'^  =  D~^), 
then  it  can  he  represented  in  the  form 

D  =  ei^,  (16) 

where  S  is  a  real  symmetric  matrix  {S  =  S  =  8'^). 


"*  The  formula  (12),  like  the  polar  decomposition  of  a  complex  matrix  (in  connection 
with  the  formulas  (87),  (88)  on  p.  278  of  Vol.  I)  has  a  close  connection  with  the  important 
Theorem  of  Cartan  which  establishes  a  certain  representation  for  the  automorphisms  of 
the  complex  Lie  groups;  see  [169],  pp.  232-233. 
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Proof.    We  set 

D  =  V-¥iY     {V=V,Y  =  Y).  (17) 

Then 

B^V  —  iV,    D^=U''-\-iV^. 

The  complex  equation  D^D'^  splits  into  the  two  real  equations 

V=TP,     Y=Y'^. 

Thus,  Z7  and  Y  are  real  symmetric  matrices. 
The  equation  BB  =  E  implies : 

V^  +  Y^  =  E,     UY  =  YU.  (18) 

By  the  second  of  these  equations,  U  and  Y  commute.  When  we  apply 
Theorem  12'  (together  with  the  Note)  of  Chapter  IX  (Vol.  I,  pp.  292-3) 
to  them,  we  obtain : 

U  =  Q{si,s^,...,s„}Q-\     V=Q[h,h,...,QQ-^.  (19) 

Here  Sjc  and  tjc  {k^l,  2,  .  .  . ,  n)  are  real  numbers.    Now  the  first  of  the 
equations  (18)  yields: 

Therefore  there  exist  real  numbers  cpjc  (A;  =  1,  2,  .  .  .  ,  n)  such  that 

Sji  =  cos(pj^,     tf,=  8m(p^     {k=l,2,,,.,n). 

Substituting  these  expressions  for  Sk  and  tk  in  (19)  and  using  (17) ,  we  find  : 

B  =  Q  {e*>s  e»>s  . . . ,  e*>"}  Q-^  =  e<«, 
where 

S=Q[(Pi.q>2.-'-,9>n}Q-^'  (20) 

From  (20)  it  follows  that  S  =  8  =  ^. 
This  proves  the  lemma. 
Using  the  lemma  we  shall  now  prove  the  following  theorem : 

Theorem  2 :    Every  unitary  matrix  U  can  he  represented  in  the  form 

V  =  Be^s,  (21) 

where  B  is  a  real  orthogonal  matrix  and  S  a  real  symmetric  matrix 

R  =  B  =  B'^-\  8  =  8  =  ^.  (22) 
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Proof.    From  (21)  it  follows  that 

U^=e^^R''.  (23) 

Multiplying  (21)  and  (23),  we  obtain  from  (22)  : 

By  Lemma  2,  the  real  symmetric  matrix  S  can  be  determined  from  the 
equation 

U^U  =  e^'^  (24) 

because  V^TJ  is  symmetric  and  unitary.  After  8  has  been  determined,  we 
determine  B  by  the  equation 

R^Ue-'^.  (25) 

Then 

R-'  =  e-^^U^,  (26) 

and  so  from  (24),  (25),  and  (26)  it  follows  that 

R^R  =  e-'^V^  C7e-*'«  =  E , 
i.e.,  B  is  orthogonal. 

On  the  other  hand,  by  (25)  2?  is  the  product  of  two  unitary  matrices 
and  is  therefore  itself  unitary.  Since  E  is  both  orthogonal  and  unitary, 
it  is  real.    Formula  (25)  can  be  written  in  the  form  (21). 

This  proves  the  theorem. 

§  2.    Polar  Decomposition  of  a  Complex  Matrix 

We  shall  prove  the  following  theorem : 

Theorem  3:  7/  A=  11  aiu  lli  is  a  non-singular  matrix  with  complex 
elements^  then 

A  =  SQ  (27) 

and 

A  =  Q^Sr,  (28) 

where  S  and  Si  are  complex  symmetric  matrices,  Q  and  Qi  complex  orthogo- 
nal matrices.    Moreover, 

8  =  iJA^  =  f  {AA") ,     S^  =  iTA  ^  /i  {A"  A) , 

where  /(A),  /i(A)  are  polynomials  in  X. 

The  factors  S  and  Q  in  {21)  {Qi  and  Si  in  (28) )  are  permutable  if  and 
only  if  A  and  A7  are  permutable. 
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Proof.  It  is  sufficient  to  establish  (27),  for  when  we  apply  this  decom- 
position to  the  matrix  A"""  and  determine  A  from  the  formula  thus  obtained, 
we  arrive  at  ( 28 ) . 

If  (27)  holds,  then 

A  =  SQ,     A^=Q-^S 
and  therefore 

AA'^  =  8^.  (29) 

Conversely,  since  AA"^  is  non-singular  (\AA'^\  =  |  A  1^  ^^^  0),  the  function 
^JX  is  defined  on  the  spectrum  of  this  matrix^  and  therefore  an  interpola- 
tion polynomial  f{X)  exists  such  that 


}/AA''  =  f{AA'').  ^  (30) 

We  denote  the  symmetric  matrix  (30)  by 

Then  (29)  holds,  and  so  |  ^  |  7^  0.    Determining  Q  from  (27) 

Q=S-^A, 

we  verify  easily  that  it  is  an  orthogonal  matrix.    Thus  (27)  is  established. 
If  the  factors  S  and  Q  in  (27)  are  permutable,  then  the  matrices 

A  =  SQ     and     A''  =  Q-''S 

are  permutable,  since 

AA^=S^     A^'A^Q-^S^Q. 

Conversely,  if  AA"^  =  A'^A,  then 

i.e.,  Q  is  permutable  with  8^  =  AA"^.    But  then  Q  is  also  permutable  with 
the  matrix  S^f{AA'^). 

Thus  the  theorem  is  proved  completely. 

2.    Using  the  polar  decomposition  we  shall  now  prove  the  following  theorem  : 


•^  See  Vol.  I,  Chapter  V,  §  1.  We  choose  a  single- valued  branch  of  the  function  V)^ 
in  a  simply  connected  domain  containing  all  the  characteristic  values  of  AA"^,  but  not  the 
number  0. 
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Theorem  4 :  If  two  complex  symmetric  or  skew-symmetric  or  orthogonal 
matrices  are  similar : 

B^T-'^AT,  (31) 

then  they  are  orthogonally  similar;  i.e.,  there  exists  am  orthogonal  matrix  Q 
such  that 

B  =  Q-^AQ.  (32) 

Proof.  From  the  conditions  of  the  theorem  there  follows  the  existence 
of  a  polynomial  q(X)  such  that 

A''=q{A),     B^  =  q{B).  (33) 

In  the  case  of  symmetric  matrices  this  polynomial  q(X)  is  identically  equal 
to  /I  and,  in  the  case  of  skew-symmetric  matrices,  to —  L  If  A  and  B  are 
orthogonal  matrices,  then  q(X)  is  the  interpolation  polynomial  for  1/X  on 
the  common  spectrum  of  A  and  B. 

Using  (33),  we  conduct  the  proof  of  our  theorem  exactly  as  we  did  the 
proof  of  the  corresponding  Theorem  10  of  Chapter  IX  in  the  real  case 
(Vol.  I,  p.  289).    From  (31)  we  deduce 

q{B)  =  T-^q{A)T 

or  by  (33) 

B^=  T-^A'^T, 
Hence 

B=T^AT''-K 
Comparing  this  equation  with  (31),  we  easily  find : 

TT'^A^^ATT^.  (34) 

Let  us  apply  the  polar  decomposition  to  the  non-singular  matrix  T 

T  =  8Q     {S  =  8^=  f{TT^),    Q^^Q-^). 

Since  by  (34)  the  matrix  TT"^  is  permutable  with  A,  the  matrix 
S  =  f(TT'^)  is  also  permutable  with  A.  Therefore,  when  we  substitute  the 
product  SQ  for  T  in  (31),  we  have 

B  =  Q-^  S-^  ASQ  =  Q-^AQ . 

This  completes  the  proof  of  the  theorem. 


§  3.    Normal  Form  of  Complex  Symmetric  Matrix 


§  3.     The  Normal  Form  of  a  Complex  Symmetric  Matrix 

1.    We  shall  prove  the  following  theorem : 

Theorem  5 :  There  exists  a  complex  symmetric  matrix  with  arbitrary 
preassigned  elementary  divisors.^ 

Proof.  We  consider  the  matrix  H  of  order  n  in  which  the  elements  of 
the  first  superdiagonal  are  1  and  all  the  remaining  elements  are  zero.  We 
shall  show  that  there  exists  a  symmetric  matrix  8  similar  to  H : 

8=THT-K  (35) 

We  shall  look  for  the  transforming  matrix  T  starting  from  the  conditions : 

S  =  THT-^  =  S^=  T^-^H^T''. 
This  equation  can  be  rewritten  as 

VH  =  H^V,  (36) 

where  V  is  the  symmetric  matrix  connected  with  T  by  the  equation  ^ 

T^T  =  —  2iV,  (37) 

Recalling  properties  of  the  matrices  H  and  F  =  H^  (Vol.  I,  pp.  13-14) 
we  find  that  every  solution  V  of  the  matrix  equation  (36)  has  the  following 
form: 

0      ...      0    tto 


0     tto 


a„_i 


(38) 


where  ao,  ai, . . . ,  ««-!  are  arbitrary  complex  numbers. 

Since  it  is  sufficient  for  us  to  find  a  single  transforming  matrix  T,  we 
&etao  =  l,ai  =  ...  =  a^-i  =  0  in  this  formula  and  define  V  by  the  equation^ 


V= 


1 


0     0 


(39) 


6  In  connection  with  the  contents  of  the  present  section  as  well  as  the  two  sections 
that  follow,  §§  4  and  5,  see  [378]. 

'''  To  simplify  the  following  formulas  it  is  convenient  to  introduce  the  factor  —  2i. 
^  The  matrix  V  is  both  symmetric  and  orthogonal. 
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Furthermore,  we  shall  require  the  transforming  matrix  T  to  be  symmetric : 

T=  T''.  (40) 

Then  the  equation  (37)  for  T  can  be  written  as: 

T^  =  —  2iV.  (41) 

We  shall  now  look  for  the  required  matrix  T  in  the  form  of  a  polynomial 
in  Y.    Since  Y^  =  E,  this  can  be  taken  as  a  polynomial  of  the  first  degree : 

From  (41),  taking  into  account  that  V^  =  E,  we  find: 

a2  +  ^2^0,     2ocP  =  —  2i. 
We  can  satisfy  these  relations  by  setting  a  =  1,  /5  =  —  i.    Then 

T=E  —  iV.  (42) 

T  is  a  non-singular  symmetrix  matrix.^    At  the  same  time,  from  (41)  : 


I.e., 


T-^  =  -^{E  +  iV), 
Thus,  a  symmetric  form  8  oi  H  is  determined  by 


(43) 


8  =-  THT-^  =     {E  —  iV)H{E  +  iV),     F = 


0  1 

1  0 


0     0 


(44) 


Since  8  satisfies  the  equation  (36)  and  Y^  —  E,  the  equation  (44)  can 
be  rewritten  as  follows : 

2S=-{H  +  H^)+i  {HV  -  VH) 


0     I 

i  . 


• 

0 

, 

. 

+  i 

• 

1 

•1 

•0 

0 


1       0 

—  1 


0-    —  1- 


(45) 


^  The  fact  that  T  is  non-singular  follows,  in  particular,  from  (41),  because  V  is  non- 
singular. 
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The  formula  (45)  determines  a  symmetric  form  S  of  the  matrix  H. 
In  what  follows,  if  n  is  the  order  oi  H,  H  =  H^"\  then  we  shall  denote  the 
corresponding  matrices  T,  V,  and  S  by  T(">,  y(">  and  S^^^K 
Suppose  that  arbitrary  elementary  divisors  are  given : 

(A-Ai)P',     a-A2)P«,   ...,   a-;iJP«.  (46) 

We  form  the  corresponding  Jordan  matrix 

For  every  matrix  H^'^i^  we  introduce  the  corresponding  symmetric  form 
S^pjK    From 

SiPj)  =  T^vj)  HiPj)  [^(P/)]-!         (/  =  1, 2,  . . . ,  u) 
it  follows  that 

Therefore  setting 


we  have : 


2^  ==  {  y(Pi)^     y(P«)^     _  ^  ^     y(Pu)  \ 


(47) 
(48) 


8=TJT-K 


8  is  a,  symmetric  form  of  /.  §  is  similar  to  J  and  has  the  same  elementary 
divisors  (46)  as  /.    This  proves  the  theorem. 

Corollary  1.  Every  square  complex  matrix  A  =  \\  Oijc  ll"!  is  similar  to  a 
symmetric  matrix. 

Applying  Theorem  4,  we  obtain : 

Corollary  2.  Every  complex  symmetric  matrix  S  =  \\  aijc  \\i  is  orthogo- 
nally similar  to  a  symmetric  matrix  with  the  normal  form  S,  i.e.,  there  exists 
an  orthogonal  matrix  Q  such  that 


8  =  QSQ-^. 


(49) 


The  normal  form  of  a  complex  symmetric  matrix  has  the  quasi-diagonal 
form 

^  =  {AiJ^(Pi)  +  ^(P.),     A2^(p.)  +  ^(p.),   ...,  4^(p«)  +  ^(P«)},  (50) 

where  the  blocks  S^^'^  are  defined  as  follows  (see  (44),  (45))  : 
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=  i-  [£f(p)  +  £f(p)T  ^  ^  (£^(p)  pr(p)  _  7(p)  £f(p))] 


0     1 

1    • 


1     0 

— 1 


(51) 
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1.  We  shall  examine  what  restrictions  the  skew  symmetry  of  a  matrix 
imposes  on  its  elementary  divisors.  In  this  task  we  shall  make  use  of  the 
following  theorem : 

Theorem  6 :    A  skew-symmetric  matrix  always  has  even  rank. 

Proof.  Let  r  be  the  rank  of  the  skew-symmetric  matrix  K.  Then  K  has 
r  linearly  independent  rows,  say  those  numbered  ii,  12,  •  •  • ,  ir '-,  all  the  remain- 
ing rows  are  linear  combinations  of  these  r  rows.  Since  the  columns  of  K 
are  obtained  from  the  corresponding  rows  by  multiplying  the  elements  by 
—  1,  every  column  of  Z  is  a  linear  combination  of  the  columns  numbered 
i\,  ^2,  .  .  . ,  if  Therefore  every  minor  of  order  r  oi  K  can  be  represented  in 
the  form 

Vi  h  •  •  • 

where  a  is  a  constant. 
Hence  it  follows  that 


K 


1  "2 


0. 


But  a  skew-symmetric  determinant  of  odd  order  is  always  zero.    There- 
fore r  is  even,  and  the  theorem  is  proved. 

Theorem  7 :    If  Xois  a  characteristic  value  of  the  skew-symmetric  matrix 
K  with  the  corresponding  elementary  divisors 


(A-^)/.,     (A-A„)/.,  ...,  (A 


KV 


then  —  Xq  is  also  a  characteristic  value  of  K  with  the  same  number  and  the 
same  powers  of  the  corresponding  elementary  divisors  of  K 
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(A+AoKs  (A+AoKs ...,  a+^y^ 

2.  //  zero  is  a  characteristic  value  of  the  skew-symmetric  matrix  7f,^° 
then  in  the  system  of  elementary  divisors  of  K  all  those  of  even  degree  cor- 
responding to  the  characteristic  value  zero  are  repeated  an  even  number 
of  times. 

Proof.  1.  The  transposed  matrix  K'''  has  the  same  elementary  divisors 
as  K.  But  K''' =  —  K,  and  the  elementary  divisors  of  — K  are  obtained 
from  those  of  K  by  replacing  the  characteristic  values  Ai,  A2,  ...  by  —  Xi, 
—  A2,  .  . . .    Hence  the  first  part  of  our  theorem  follows. 

2.  Suppose  that  to  the  characteristic  value  zero  of  K  there  correspond  ^1 
elementary  divisors  of  the  form  A,  62  of  the  form  X^,  etc.  In  general,  we 
denote  by  dp  the  number  of  elementary  divisors  of  the  form  X^  {p  =  l,2,  .  .  .). 
We  shall  show  that  62,  64,  . . .  are  even  numbers. 

The  defect  d  oi  K  is  equal  to  the  number  of  linearly  independent  charac- 
teristic vectors  corresponding  to  the  characteristic  value  zero  or,  what  is  the 
same,  to  the  number  of  elementary  divisors  of  the  form  X,  X^,  X^,  .  .  .  .  There- 
fore 

d  =  d^+d2  +  d^+"'.  (52) 

Since,  by  Theorem  6,  the  rank  of  K  is  even  and  d  =  n  —  r,  d  has  the  same 
parity  as  n.  The  same  statement  can  be  made  about  the  defects  ds,  d^,  . . .  of 
the  matrices  K^,  K^,  . . . ,  because  odd  powers  of  a  skew-symmetric  matrix 
are  themselves  skew-symmetric.  Therefore  all  the  numbers  di  =  d,  dz,  ^5, .  .  . 
have  the  same  parity. 

On  the  other  hand,  when  K  is  raised  to  the  m-th  power,  every  elementary 
divisor  A^  for  p  <  m  splits  into  p  elementary  divisors  (of  the  first  degree) 
and  for  p^m  into  m  elementary  divisors. ^^  Therefore  the  number  of  ele- 
mentary divisors  of  the  matrices  K,  K^,  . . .  that  are  powers  of  X  are  deter- 
mined by  the  formulas^^ 

rf3  =  5l-f2^2  +  3(^3-f^4  +  ..-), 

cig  =^1 -f  2^2 -f  3^3+ 4^4 +  5(^5 +  (56 +  •••),  (53) 

Comparing  (52)  with  (53)  and  bearing  in  mind  that  all  the  numbers 
di  =  d,  dz,  ds, .  • .  are  of  the  same  parity,  we  conclude  easily  that  62,  ^4,  •  •  •  are 
even  numbers. 

This  completes  the  proof  of  the  theorem. 


10  I.e.,  if  1^1  =  0.    For  odd  n  we  always  have  \K\=0. 

11  See  Vol.  I,  Chapter  VI,  Theorem  9,  p.  158. 

^^  These  formulas  were  introduced  (without  reference  to  Theorem  9)  in  Vol.  I,  Chapter 
VI  (see  formulas  (49)  on  p.  155). 
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2.  Theorem  8 :  There  exists  a  skew-symmetric  matrix  with  arbitrary  pre- 
assigned  elementary  divisors  subject  to  the  restrictions  1.,  2.  of  the  pre- 
ceding theorem. 

Proof.     To  begin  with,  we  shall  find  a  skew-symmetric  form  for  the 
quasi-diagonal  matrix  of  order  2p : 

Jif-{Ao^  +  H,-^^-^}  (54) 

having  two  elementary  divisors   (X  —  Xo)^  and  (A^-^)^;  here  E  =  E^^^, 

We  shall  look  for  a  transforming  matrix  T  such  that 

is  skew-symmetric,  i.e.,  such  that  the  following  equation  holds : 

or 

WJt^-{-[jT^rW=0,  (55) 

where  W  is  the  symmetric  matrix  connected  with  T  by  the  equation^^ 

T''T=—2iW.  (56) 

We  dissect  W  into  four  square  blocks  each  of  order  p : 


W21   wj' 


W 

Then  ( 55 )  can  be  written  as  follows : 
(Wn     W^^aoE  +  H  0        > 

W21     ^22/1      0        -XoE-Hj 

When  we  perform  the  indicated  operations  on  the  partitioned  matrices 
on  the  left-hand  side  of  (57),  we  replace  this  equation  by  four  matrix 
equations : 

1.  H''Wu+Wn{^^oE-]-H)=0, 

2.  H^W,,-W,,H=0, 

3.  H^W^i-W2iH  =  0,  ^ 

4.  H^W22+W22{^^oE-\-H)=0. 


13  See  footnote  7  on  p.  9. 
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The  equation  AX  —  XB  =  0,  where  A  and  B  are  square  matrices  without 
common  characteristic  values,  has  only  the  trivial  solution  X  =  0.^^  There- 
fore the  first  and  fourth  of  the  equations  (58)  yield:  Wii  =  W22  =  0}^ 
As  regards  the  second  of  these  equations,  it  can  be  satisfied,  as  we  have  seen 
in  the  proof  of  Theorem  5,  by  setting 


(59) 


0     .     .     .     0 

1 

Wn=V  = 

0     ...      1 

0 

since  (cf.  (36)) 

1     ...     0 

0 

VH-H^V  =  0. 

From  the  symmetry  of  W  and  V  it  follows  that 

W,,=WJ,=  V. 

The  third  equation  is  then  automatically  satisfied. 
Thus, 


\v  o) 


(60) 


But  then,  as  has  become  apparent  on  page  10,  the  equation  (56)  will  be 
satisfied  if  we  set 


y— Jg7(2p)_^7(2p)^ 


(61) 


Then 


T-^=Y  {E^^  ^^  +  *F<2  P)) ,  (62) 

Therefore,  the  required  skew-symmetric  matrix  can  be  found  by  the  f ormula^^ 

Xl?^)  =  I  [^<2  p)  _  .^(2  p)-j  ^(pp)  j.^(2  p)  ^   .^(2  p)^ 


When  we  substitute  for  f^^  and  V^^p^  the  corresponding  partitioned 
matrices  from  (54)  and  (60),  we  find: 

^-t  See  Vol.  I,  Chapter  VIII,  §  1. 

15  For  ?Lo^0  the  equations  1.  and  4.  have  no  solutions  other  than  zero.     For  ;io  =  0 
there  exist  other  solutions,  but  we  choose  the  zero  solution. 

16  Here  we  use  equations  (55)  and  (60).    From  these  it  follows  that 

y(2p)  j(PP)  y(2p)  _  _j(PP)t 
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^\    —"2 


IE-B7      0     \      .rk^E^E         0        \i^     '^ 

\    0    W-uyx    0     -\^b-b)\y   0 

AO     FWAo^  +  if        0        y 
\Y    o]  \      0       —XqE-h) 

~  "2"\-t  (2  AoF  +  HV+  VH)  H^-U 


I.e., 


jt(.PP)  —  ±_ 


0          1  .    . 

0 

0    .    .    . 

.    .    .    i     2Ao 

-1         0    •  . 

• 

• 

.   '      2Xo     i 

. 

•   .        •   .        1 

*  * 

. 

0         ... 

—  1           0 

.2Ap: ' 

i       ...    0 

0     77'\ 

-i    -2^0 

0      —1  . 

...          0 

.   *  -2Ao    -i 

1         0 

• 

' 

' 

. 

* 

' 

, 

• 

—  i   . 

• 

• 

•  —1 

-2Ao    '      -i  ' 

...        0 

0         .    . 

.     *  1            0 

(64) 


(65) 


We  shall  now  construct  a  skew-symmetric  matrix  K^'J^  of  order  q  having 
one  elementary  divisor  A^,  where  q  is  odd.  Obviously,  the  required  skew- 
symmetric  matrix  will  be  similar  to  the  matrix 


J<*>  = 


0     1     0 
0     0     1 


0 


1  0 
0  — 1 
0         0 


(66) 
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In  this  matrix  all  the  elements  outside  the  first  superdiagonal  are  equal  to 
zero,  and  along  the  first  superdiagonal  there  are  at  first  {q  —  1 ) /2  ele- 
ments 1  and  then  (g  —  l)/2  elements  — 1.     Setting 

K^'^  =  TJ^'^T-\  (67) 

we  find  from  the  condition  of  skew-symmetry : 

WiJ^'^  +  J^'^'^Wt,  =  0,  (68) 

where 

T^T=-2iWi.  (69) 

By  direct  verification  we  can  convince  ourselves  that  the  matrix 


jr,=F">= 


0  . 
0 

..01 
..10 

1 

..00 

satisfies  the  condition  (68).    Taking  this  value  for  Wi  we  find  from  (69), 
as  before : 


When  we  perform  the  corresponding  computation,  we  find 


(70) 


(71) 


2^<"  = 


0 

1  .  .  . 

.   .   . 

.    0 

0 

,      , 

..10 

—  1 

0*  . 

•     •     1 

• 

• 

+  i 

• 

• 

• 

• 

—  1 

—  1 

• 

• 

0 

1 

0 

0 

—  1 

.    .    .        0 

(72) 


Suppose  that  arbitrary  elementary  divisors  are  given,  subject  to  the 
conditions  of  Theorem  7 : 
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(A-A//,(A  +  A/^"  0=1,2,  ...,u),  ^ 

)^k  (A;=l,  2,  . . .,  v\  gfi,  ^2.  •  •  •>  g'e  are  odd  numbers)/^  (        (73) 


Then  the  quasi-diagonal  skew-symmetric  matrix 


(74) 


has  the  elementary  divisors  (73). 

This  concludes  the  proof  of  the  theorem. 

Corollary  :  Every  complex  skew-symmetric  matrix  K  is  orthogonally 
similar  to  a  skew-symmetric  matrix  having  the  normal  form  K  determined 
hy  (74),  (65),  and  (72)  ;  i.e.,  there  exists  a  (complex)  orthogonal  matrix  Q 
such  that 


K=  QKQ-\ 


(75) 


Note.     If  iC  is  a  real  skew-symmetric  matrix,  then  it  has  linear  ele- 
mentary divisors  (see  Vol.  I,  Chapter  IX,  §  13). 


X  —  i(pi,  ^+i(pi,  . . .,  A  —  iq)^^,  X  +  i(p^,  A,  . . .,  A    {cpj  are  real  numbers). 


V  times 


In  this  case,  setting  all  the  Pj  =  l  and  all  the  g&  =  l  in  (74),  we  obtain  as 
the  normal  form  of  a  real  skew-symmetric  matrix 


K-^ 


0    9.1 
-9^1  0 


0    9^. 


,  0, 


«i 
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1.    Let  us  begin  by  examining  what  restrictions  the  orthogonality  of  a 
matrix  imposes  on  its  elementary  divisors. 

Theorem  9  :    1.    //  Ao  (A?  ^^  1)  ^s  a  characteristic  value  of  an  orthogonal 
matrix  Q  and  if  the  elementary  divisors 

(A-Ao)/s(;i-Ao)/.,  ...,  (A-Ao)J^' 


1^  Some  of  the  numbers  Xi,  ?i2,  .  •  . ,  )^u  may  be  zero.     Moreover,  one  of  the  numbers  (/ 
and  V  may  be  zero;  i.e.,  in  some  cases  there  may  be  elementary  divisors  of  only  one  type. 
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correspond  to  this  characteristic  value,  then  1/Ao  is  also  a  characteristic  value 
of  Q  and  it  has  the  same  corresponding  elementary  divisors : 

{X  -  Ai-i)/.,  {X  -  X-')r',  . . .,  (A  -  X-^'. 

2.  If  Xo=  ±  1  is  a  characteristic  value  of  the  orthogonal  matrix  Q,  then 
the  elementary  divisors  of  even  degree  corresponding  to  lo  are  repeated  an 
even  number  of  times. 

Proof.  1.  For  every  non-singular  matrix  Q  on  passing  from  Q  to  Q~^ 
each  elementary  divisor  (A  —  Aq)^  is  replaced  by  the  elementary  divisor 
{X  —  X~^'^)t.^^  On  the  other  hand,  the  matrices  Q  and  Q"""  always  have  the 
same  elementary  divisors.  Therefore  the  first  part  of  our  theorem  follows 
at  once  from  the  orthogonality  condition  Q"^  =  Q~^. 

2.  Let  us  assume  that  the  number  1  is  a  characteristic  value  of  Q,  while 
—  li8noti\E  —  Q\=0,\E  +  Q\y^0).  Then  we  apply  Cayley's  formulas 
(see  Vol.  I,  Chapter  IX,  §  14),  which  remain  valid  for  complex  matrices. 
We  define  a  matrix  K  by  the  equation 

K  =  {E-Q){E  +  Q)-K  (76) 

Direct  verification  shows  that  K"^  =  —  K,  so  that  K  is  skew-symmetric. 
When  we  solve  the  equation  (76)  for  Q,  we  find:^^ 

Q  =  {E-K){E+K)-K 

I X  2 

Setting  fW  =  j^,  we  have  f(A)  =—  y^  0.    Therefore  in  the  transi- 

tion from  K  to  Q  =  f{K)  the  elementary  divisors  do  not  split.^^  Hence  in 
the  system  of  elementary  divisors  of  Q  those  of  the  form  {X  —  1)^^  are  re- 
peated an  even  number  of  times,  because  this  holds  for  the  elementary 
divisors  of  the  form  X^^  of  K  (see  Theorem  7). 

The  case  where  Q  has  the  characteristic  value  —  1,  but  not  +  1,  is  reduced 
to  the  preceding  case  by  considering  the  orthogonal  matrix  —  Q. 

We  now  proceed  to  the  most  complicated  case,  where  Q  has  both  the 
characteristic  value  +  1  and  — 1.  We  denote  by  \p{X)  the  minimal  poly- 
nomial of  Q.  Using  the  first  part  of  the  theorem,  which  has  already  been 
proved,  we  can  write  \p{X)  in  the  form 


18  See  Vol.  I,  Chapter  VI,  §7.  Setting  f{l)=l/X  we  have  f  {X)  =—l/\^  ^0. 
Hence  it  follows  that  in  the  transition  from  Q  to  Q~'  the  elementary  divisors  do  not  split 
(see  Vol.  I,  p.  158). 

19  Note  that  (76)  implies  that  E  -\- K  =  2{E  -^  QY^  and  therefore 

I  i;  +  ^  I  r=  2«  I  ^  +  Q  I"  v^  0. 

20  See  Vol.  I,  p.  158. 
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rp  W  =  (A  -  ir  (A  +  If  n(X-  Xp  (A  -  Xj'ri  (AJ#  1 ;  ;•  =  1, 2,  . . .  ,  «). 

7=1 

We  consider  the  polynomial  g{X)  of  degree  less  than  m  (m  is  the  degree 
of  ip(^))  for  which  g{l)  =1  and  all  the  remaining  m  —  1  values  on  the 
spectrum  of  Q  are  zero ;  and  we  set  :^^ 

P=9{Q)-  (77) 

Note  that  the  functions  (gr(A)  )^  and  ^(1/A)  assume  on  the  spectrum  of  Q 
the  same  values  as  ^( A).    Therefore 

P^  =  P,  P^=g{Q-)  =  g{Q-^)=P,  (78) 

i.e.,  P  is  a  symmetric  projective  matrix.^^ 

We  define  a  polynomial  /i(A)  and  a  matrix  N  by  the  equations 

ha)  =  {X-l)g{kl  (79) 

N=h{Q)={Q-E)P.  (80) 

Since  (/i(i^))*"^  vanishes  on  the  spectrum  of  Q,  it  is  divisible  by  7/;(2) 
without  remainder.    Hence : 

i.e.,  iV^  is  a  nilpotent  matrix  with  mi  as  index  of  nilpotency. 
From  (80)  we  find  r^^ 

N^=(Q'-E)P.  (81) 


21  From  the  fundanieiital  formula  (see  Vol.  I,  p.  104) 

gf<^)  =  2^  [.g  kh)  Zki  +  g'  {h)  Zi2  +  •  •  •] 

it  follows  that 

-2  A  hermitian  operator  P  is  called  projective  if  P"  ^=  P.  In  accordance  with  this, 
a  hermitian  matrix  P  for  Avhich  P- =:  P  is  called  projective.  An  example  of  a  projective 
operator  P  in  a  unitary  space  R  is  the  operator  of  the  orthogonal  projection  of  a  vector 
X  eR  into  a  subspace  5  =  PR,  i.e.,  Px  =  Xg,  Avhere  ^5^5  and  (x  —  Xg)  ±8  (see  Vol.  I, 
p.  248). 

23  All  the  matrices  that  occur  here,  P,  N,  N"^,  Q'^  =  Q-'^,  are  permutable  among  each 
other  and  with  Q,  since  they  are  all  functions  of  Q. 
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Let  us  consider  the  matrix 

R  =  N(N^  +  2E).  (82) 

From  (78),  (80),  and  (81)  it  follows  that 

R=  NN^  +  2N={Q-Q'')  P, 

From  this  representation  of  R  it  is  clear  that  R  is  skew-symmetric. 
On  the  other  hand,  from  (82) 

R^=N^{N^  +  2Ef     (Aj=1,2,  ...).  (83) 

But  N^,  like  N,  is  nilpotent,  and  therefore 

|i^"'  +  2^|v^0. 

Hence  it  follows  from  (83)  that  the  matrices  R^  and  N^  have  the  same  rank 
for  every  k. 

Now  for  odd  k  the  matrix  R^  is  skew-symmetric  and  therefore  (see  p.  12) 
has  even  rank.    Therefore  each  of  the  matrices 

jV,ivr3,i\r6, ... 

has  odd  rank. 

By  repeating  verbatim  for  iV^  the  arguments  that  were  used  on  p.  13 
for  K  we  may  therefore  state  that  among  the  elementary  divisors  of  N  those 
of  the  form  X^^  are  repeated  an  even  number  of  times.  But  to  each  ele- 
mentary divisor  A^^  of  N  there  corresponds  an  elementary  divisor  (A  —  1)^^ 
of  Q,  and  vice  versa.^*  Hence  it  follows  that  among  the  elementary  divisors 
of  Q  those  of  the  form  {I  —  1)^^  are  repeated  an  even  number  of  times. 

We  obtain  a  similar  statement  for  the  elementary  divisors  of  the  form 
(A  +  1)2/'  by  applying  what  has  just  been  proved  to  the  matrix  —  Q. 

Thus,  the  proof  of  the  theorem  is  complete. 

2.    We  shall  now  prove  the  converse  theorem. 


24  Since  /i(l)  =0,  /i'(l)  v-^  0,  in  passing  from  Q  to  N=zh(Q)  the  elementary  divisors 
of  the  form  (X —  1)^^  of  Q  do  not  split  and  are  therefore  replaced  by  elementary  divisors 
VP  (see  Vol.  I,  Chapter  VI,  §  7). 
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Theorem  10 :    Every  system  of  powers  of  the  form 


a-l)^(A-l)^...,  (A-ir, 

(X  +  l)\  {X  +  1)S  . . .,  (A  +  1)''^ 

(9'i>  •  •  '>  Qv>  hi  •  •  -i^w  ^'"'^  odd  numbers) 


(84) 


is  the  system  of  elementary  divisors  of  some  complex  orthogonal  matrix  Q}^ 

Proof.     We  denote  by  fXj  the  numbers  connected  with  the  numbers  Xj 
{3=^1,2,  ...  ,n)  hj  the  equations 

A;  =  e^7   (/=1,2,  ...,w) 

We  now  introduce  the  *  canonical'  skew-symmetric  matrices  (see  the  pre- 
ceding section) 

^(^P  (•  ^  1,  2,  . . .,  u)\  K(^^\  . . .,  K(^^^;  K('^\  ..,,  K^'-\ 
with  the  elementary  divisors 

{X-fi^fi,  (A  +  ///^-  (;  =  1,  2,  . . . ,  u)  A^s  . . . ,  A^-  AS  . . . ,  A'-. 
If  iC  is  a  skew-symmetric  matrix,  then 

is  orthogonal  ( Q^  =  e^^  =  e~^  =  Q~^ ) .    Moreover,  to  each  elementary  divi- 
sor (A  —  fx)^  oiK  there  corresponds  an  elementary  divisor  (A  —  6^)^  of  Q.^^ 
Therefore  the  quasi-diagonal  matrix 

I    K^PiPi^  viVuPu)         Ail)  AQv)  Ah)  eS^v;)  ^ 

Q  =  {e  f^^    ,...,e^«      ;  e       ,  . . .,  e        ;  -e       ,...,-e        }      (85) 

is  orthogonal  and  has  the  elementary  divisors  (84). 
This  proves  the  theorem. 
From  Theorems  4,  9,  and  10  we  obtain : 


2^  Some  (or  even  all)  of  the  numbers  "kj  may  be  ±  1.  One  or  two  of  the  numbers 
u,  V,  w  may  be  zero.  Then  the  elementary  divisors  of  the  corresponding  type  are  absent 
in  Q. 

26  This  follows  from  the  fact  that  for  /(?i)  =:  e*  we  have  /'(X)  —  e^^^  0  for  every  L 
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Corollary:  Every  {complex)  orthogonal  matrix  Q  is  orthogonally 
similar  to  an  orthogonal  matrix  having  the  normal  form  Q;  i.e.,  there  exists 
an  orthogonal  matrix  Qi  such  that 

Q=Q,QQ7'-  (86) 

Note.  Just  as  we  have  given  a  concrete  form  to  the  diagonal  blocks  in  the 
skew-symmetric  matrix  K,  so  we  could  for  the  normal  form  Q.^'' 


See  [378]. 
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SINGULAR  PENCILS  OF  MATRICES 
§  1.    Introduction 

1.    The  present  chapter  deals  with  the  following  problem: 

Given  four  matrices  A,  B,  Ai,  Bi  all  of  dimension  m  X  n  with  elements 
from  a  number  field  f,  it  is  required  to  find  under  what  conditions  there 
exist  two  square  non-singular  matrices  P  and  Q  of  orders  m  and  n,  respec- 
tively, such  that^ 

PAQ=Ai,     PBQ  =  B^  (!) 

By  introduction  of  the  pencils  of  matrices  A-\-  IB  and  A^  +  /-Bi  the 
two  matrix  equations  (1)  can  be  replaced  by  the  single  equation 

P{A+XB)Q=Ay  +  XB^  (2) 

Definition  1 :  Two  pencils  of  rectangular  matrices  A  +  XB  and  Ax  +  XB^ 
of  the  same  dimensions  m  X  n  connected  by  the  equation  (2)  in  which  P  and 
Q  are  constant  square  non-singular  matrices  {i.e.,  matrices  independent  of 
X)  of  orders  m  and  n,  respectively,  will  be  called  strictly  equivalent."^ 

According  to  the  general  definition  of  equivalence  of  A-matrices  (see 
Vol.  I,  Chapter  Yl,  p.  132),  the  pencils  A  +  AB  and  A^  +  XBi  are  equivalent 
if  an  equation  of  the  form  (2)  holds  in  which  P  and  Q  are  two  square 
A-matrices  with  constant  non-vanishing  determinants.  For  strict  equivalence 
it  is  required  in  addition  that  P  and  Q  do  not  depend  on  X.^ 

A  criterion  for  equivalence  of  the  pencils  A  +  XB  and  Ai  +  XBi  follows 
from  the  general  criterion  for  equivalence  of  A-matrices  and  consists  in  the 
equality  of  the  invariant  polynomials  or,  what  is  the  same,  of  the  elementary 
divisors  of  the  pencils  A -\-  XB  and  Ai  +  XBi  (see  Vol.  I,  Chapter  VI,  p.  141) . 


^  If  such  matrices  P  and  Q  exist,  then  their  elements  can  be  taken  from  the  field  f. 
This  follows  from  the  fact  that  the  equations  (1)  can  be  written  in  the  form  PA  =  AiQ-^, 
PB  =  BiQ~^  and  are  therefore  equivalent  to  a  certain  system  of  linear  homogeneous  equa- 
tions for  the  elements  of  P  and  Q~^  with  coefficients  in  f. 

2  See  Vol.  I,  Chapter  VI,  p.  145. 

3  We  have  replaced  the  term  '  equivalent  pencils '  that  occurs  in  the  literature  by 
'strictly  equivalent  pencils,'  in  order  to  draw  a  sharp  distinction  between  Definition  1  and 
the  definition  of  equivalence  in  Vol.  I,  Chapter  VI. 

24 
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In  this  chapter,  we  shall  establish  a  criterion  for  strict  equivalence  of 
two  pencils  of  matrices  and  we  shall  determine  for  each  pencil  a  strictly 
equivalent  canonical  form. 

2.  The  task  we  have  set  ourselves  has  a  natural  geometrical  interpretation. 
We  consider  a  pencil  of  linear  operators  A  +  XB  mapping  R„  into  Rm-  For 
a  definite  choice  of  bases  in  these  spaces  the  pencil  of  operators  A  +  XB  cor- 
responds to  a  pencil  of  rectangular  matrices  A  +  XB  (of  dimension  m  X  w)  ; 
under  a  change  of  bases  in  R^  and  Km  the  pencil  A  +  XB  is  replaced  by  a 
strictly  equivalent  pencil  P{A-\-  XB)Q,  where  P  and  Q  are  square  non- 
singular  matrices  of  order  m  and  n  (see  Vol.  I,  Chapter  III,  §§  2  and  4). 
Thus,  a  criterion  for  strict  equivalence  gives  a  characterization  of  that  class 
of  matrix  pencils  A-\-  XB  (of  dimension  m  X  ti)  which  describe  one  and  the 
same  pencil  of  operators  A-\-  XB  mapping  R^  into  Rm  for  various  choices  of 
bases  in  these  spaces. 

In  order  to  obtain  a  canonical  form  for  a  pencil  it  is  necessary  to  find 
bases  for  R„  and  Rm  in  which  the  pencil  of  operators  A  ■]-  XB  i^  described  by 
matrices  of  the  simplest  possible  form. 

Since  a  pencil  of  operators  is  given  by  two  operators  A  and  B,  we  can 
also  say  that :  The  present  chapter  deals  with  the  simultaneous  investigation 
of  two  operators  A  and  B  mapping  Rn  into  Rm. 

3.  All  the  pencils  of  matrices  A  +  XB  of  dimension  m  X  n  fall  into  two 
basic  types :  regular  and  singular  pencils. 

Definition  2 :    A  pencil  of  matrices  A  +  XB  is  called  regular  if 

1)  A  and  B  are  square  matrices  of  the  same  order  n;  and 

2)  The  determinant  \  A  +  XB  \  does  not  vanish  identically. 

In  all  other  cases  (m^n,  or  m  =  n  but  \  A  +  XB  \^0),  the  pencil  is  called 
singular. 

A  criterion  for  strict  equivalence  of  regular  pencils  of  matrices  and  also 
a  canonical  form  for  such  pencils  were  established  by  Weierstrass  in  1867 
[377]  on  the  basis  of  his  theory  of  elementary  divisors,  which  we  have  ex- 
pounded in  Chapters  VI  and  VII.  The  analogous  problems  for  singular 
pencils  were  solved  later,  in  1890,  by  the  investigations  of  Kronecker  [249]  ."* 
Kronecker's  results  form  the  primary  content  of  this  chapter. 

§  2.    Regular  Pencils  of  Matrices 

1.  We  consider  the  special  case  where  the  pencils  A  +  XB  and  Ai  +  XBi 
consist  of  square  matrices  (m  =  n)  \  B  \y^0,  \  Bi\  y^O.  In  this  case,  as  we 
have  shown  in  Chapter  VI  (Vol.  I,  pp.  145-146),  the  two  concepts  of  'equiv- 


^  Of  more  recent  papers  dealing  with  singular  pencils  of  matrices  we  mention   [234], 
[369],  and  [255]. 
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alence '  and  *  strict  equivalence '  of  pencils  coincide.  Therefore,  by  applying 
to  the  pencils  the  general  criterion  for  equivalence  of  A-matrices  (Vol.  I, 
p.  141 )  we  are  led  to  the  following  theorem : 

Theorem  1 :  Two  pencils  of  square  matrices  of  the  same  order  A-\-  XB 
and  Ai  +  ABi  for  which  \B  \y^0  and  \  Bi\=^0  are  strictly  equivalent  if  and 
only  if  the  pencils  have  the  same  elementary  divisors  in  f. 

A  pencil  of  square  matrices  A-}-  XB  with  |  B  |  ^^  0  was  called  regular  in 
Chapter  VI,  because  it  represents  a  special  case  of  a  regular  matrix  poly- 
nomial in  X  (see  Vol.  I,  Chapter  IV,  p.  76).  In  the  preceding  section  of  this 
chapter  we  have  given  a  wider  definition  of  regularity.  According  to  this 
definition  it  is  quite  possible  in  a  regular  pencil  to  have  |  5  |  =0  (and  even 
\A\  =  \B\=0). 

In  order  to  find  out  whether  Theorem  1  remains  valid  for  regular  pencils 
(with  the  extended  Definition  1),  we  consider  the  following  example : 


A+^B 


2  1  3 

3  2  5 
3  2  6 


+  A 


1  1  2 
1  1  2 
1   1  3 


Ai+?.Bi= 


2  1   1 

1   1   1 

1   2   1 

+  A 

1  1   1 

1   1   1 

1  1   1 

(3) 


It  is  easy  to  see  that  here  each  of  the  pencils  A-\-  XB  and  Ai  +  XBi  has 
only  one  elementary  divisor,  X-\- 1.  However,  the  pencils  are  not  strictly 
equivalent,  since  the  matrices  B  and  Bi  are  of  ranks  2  and  1,  respectively ; 
whereas  if  an  equation  (2)  were  to  hold,  it  would  follow  from  it  that  the 
ranks  of  B  and  Bi  are  equal.  Nevertheless,  the  pencils  (3)  are  regular 
according  to  Definition  1,  since 

This  example  shows  that  Theorem  1  is  not  true  with  the  extended  defini- 
tion of  regularity  of  a  pencil. 

2.  In  order  to  preserve  Theorem  1,  we  have  to  introduce  the  concept  of  '  in- 
finite '  elementary  divisors  of  a  pencil.  We  shall  give  the  pencil  A  +  XB  in 
terms  of  'homogeneous'  parameters  X,  ju:  juA  +  XB.  Then  the  determinant 
A{X,  ju)  ^  I  //A  +  XB  \  is  a  homogeneous  function  of  X,  ju.  By  determining 
the  greatest  common  divisor  Djc{X,  ju)  of  all  the  minors  of  order  k  of  the 
matrix  juA  +  XB  (A;  =  1,  2,  .  .  . ,  n),  we  obtain  the  invariant  polynomials  by 
the  well  known  formulas 


Dn{X,fi) 


I>n-l  (A,  i") 


here  all  the  DjdX,  ju)  and  ij(X,  ju)  are  homogeneous  polynomials  in  X  and  jn. 
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Splitting  the  invariant  polynomials  into  powers  of  homogeneous  polynomials 
irreducible  over  f,  we  obtain  the  elementary  divisors  ea{^,  f^)  (a  =  1,  2,  .  .  .) 
of  the  pencil  fiA  +  kB  in  f. 

It  is  quite  obvious  that  if  we  set  //  =  1  in  ea{^,  fi)  we  are  back  to  the  ele- 
mentary divisors  ea(^)  of  the  pencil  A  +  XB.  Conversely,  from  each  ele- 
mentary divisor  eaW  of  degree  q  we  obtain  the  correspondingly  elementary 
divisor  ea(>^,  i")  by  the  formula  ea  (^,//)  =>*ea(--)  •  We  can  obtain  in  this  way 
all  the  elementary  divisors  of  the  pencil  fiA  +  IB  apart  from  those  of  the 
form  ^^. 

Elementary  divisors  of  the  form  ii^  exist  if  and  only  \i\B\=0  and  are 
called  'infinite'  elementary  divisors  of  the  pencil  A  +  IB. 

Since  strict  equivalence  of  the  pencils  A  +  XB  and  Ai  +  XBi  implies 
strict  equivalence  of  the  pencils  ^lA  +  XB  and  ^lAi  +  XBi,  we  see  that  for 
strictly  equivalent  pencils  A  +  XB  and  Ai  H-  XBi  not  only  their  'finite,'  but 
also  their  'infinite'  elementary  divisors  must  coincide. 

Suppose  now  that  A-h  XB  and  Ai  +  XBi  are  two  regular  pencils  for 
which  all  the  elementary  divisors  coincide  (including  the  infinite  ones). 
We  introduce  homogeneous  parameters :  /nA  +  XB,  /nAi  +  XBi.  Let  us  now 
transform  the  parameters 

In  the  new  parameters  the  pencils  are  written  as  follows : 

fiA-\-XB,  fill  +  IBj  ,  where  B  =  ^^A  +  a^B,  Bi  =  ^lAi  +  aiBi. 

From  the  regularity  of  the  pencils  juA  +  XB  and  jnAi  +  XBi  it  follows  that  we 
can  choose  the  numbers  ai  and  /?i  such  that  |  S  |  ^^  0  and  \  Bi\^0. 

Therefore  by  Theorem  1  the  pencils  jUA  -f  XB'  and  jlA^  +  ISj  and  con- 
sequently the  original  pencils  juA  +  XB  and  fiAi  +  XBi  (or,  what  is  the  same, 
A  +  XB  and  Ai  +  XBi)  are  strictly  equivalent.  Thus,  we  have  arrived  at 
the  following  generalization  of  Theorem  1 : 

Theorem  2:  Two  regular  pencils  A+J,B  and  Ai  +  XBi  are  strictly 
equivalent  if  and  only  if  they  have  the  same  {^finite'  and  'infinite')  ele- 
mentary divisors. 

In  our  example  above  the  pencils  (3)  had  the  same  'finite'  elementary 
divisor  X  +  1,  but  different  'infinite'  elementary  divisors  (the  first  pencil 
has  one ' infinite'  elementary  divisor  jj.^  ;  the  second  has  two  :  fi,  fi).  Therefore 
these  pencils  turn  out  to  be  not  strictly  equivalent. 
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3.  Suppose  now  that  A  +  }.B  is  an  arbitrary  regular  pencil.  Then  there 
exists  a  number  c  such  that  \  A  +  cB  \y^0.  We  represent  the  given  pencil 
in  the  form  Ai+  (>? — c)B,  where  Ai  =  A-\-  cB,  so  that  |  Ai  |  =7^  0.  We 
multiply  the  pencil  on  the  left  by  Aj^ :  E  +  {k  —  c)Aj^B.  By  a  similarity 
transformation  we  put  the  pencil  in  the  form^ 

E  +  (A-c)  {J^,J^]  =  {E-cJo  +  AJo,  E-cJ^  +  AJi) ,  (4) 

where  {Jo,  Ji}  is  the  quasi-diagonal  normal  form  of  Ai^B,  Jo  is  a  nilpotent 
Jordan  matrix,^  and  |  Ji  |  =7^  0. 

We  multiply  the  first  diagonal  block  on  the  right-hand  side  of  (4)  by 
(E  —  cJo)~^  and  obtain:  E  +  X{E  —  cJ'o)~Vo.  Here  the  coefficient  of  X 
is  a  nilpotent  matrix.^  Therefore  by  a  similarity  transformation  we  can 
put  this  pencil  into  the  form^ 

e  +  j[jq  =  {  i\r("i),  i^(««>, . . .,  ivr<«'> }  (i^<«>  =  JSJ<«>  +  A£f<«>) .  (5) 

We  multiply  the  second  diagonal  block  on  the  right-hand  side  of  (4)  by 
Ji^ ;  it  can  then  be  put  into  the  form  J  +  XE  by  a  similarity  transformation, 
where  J  is  a  matrix  of  normal  form®  and  E  the  unit  matrix.  We  have  thus 
arrived  at  the  following  theorem : 

Theorem  3 :  Every  regular  pencil  A  +  ?.B  can  he  reduced  to  a  (strictly 
equivalent)  canonical  quasi-diagonal  form 

[N(u,)^  iv^(«.), . . . ,  m^^\  J  +  ?lE}  (iv^(«) = je;(")  +  ^£r(«)) ,  (6) 

where  the  first  s  diagonal  Mocks  correspond  to  infinite  elementary  divisors 
fjL^\  //"%  .  . . ,  ju'^'of  the  pencil  A  +  XB  and  where  the  normal  form  of  the  last 
diagonal  Mock  J  +  XE  is  uniquely  determined  hy  the  finite  elementary 
divisors  of  the  given  pencil. 


5  The  unit  matrices  E  in  the  diagonal  blocks  on  the  right-hand  side  of  (4)  have  the 
same  order  as  Jo  and  Ji. 

^  I.e.,  J  J  =  0  for  some  integer  I  >  0. 

7  From  Jo^  =0  it  follows  that  [(E  —  cJo^Jo]^  =  0. 

8  Here  JE(")  is  a  unit  matrix  of  order  u  and  HM  is  a  matrix  of  order  u  whose  elements 
in  the  first  superdiagonal  are  1,  while  the  remaining  elements  are  zero. 

9  Since  the  matrix  J  can  be  replaced  here  by  an  arbitrary  similar  matrix,  we  may 
assume  that  J  has  one  of  the  normal  forms  (for  example,  the  natural  form  of  the  first 
or  second  kind  or  the  Jordan  form  (see  Vol.  I,  Chapter  VI,  §  6)). 
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§  3.     Singular  Pencils.      The  Reduction  Theorem 

1.  We  now  proceed  to  consider  a  singular  pencil  of  matrices  A  +  ?.B  of 
dimension  m  X  n.  We  denote  by  r  the  rank  of  the  pencil,  i.e.,  the  largest 
of  the  orders  of  minors  that  do  not  vanish  identically.  From  the  singu- 
larity of  the  pencil  it  follows  that  at  least  one  of  the  inequalities  r  <  n  and 
r  <i'm  holds,  say  r  <  n.  Then  the  columns  of  the  A-matrix  A  -\-  XB  are 
linearly  dependent,  i.e.,  the  equation 

{A+2.B)x  =  o,  (7) 

where  x  is  an  unknown  column  matrix,  has  a  non-zero  solution.  Every 
non-zero  solution  of  this  equation  determines  some  dependence  among  the 
columns  oi  A  +  XB.  We  restrict  ourselves  to  only  such  solutions  a; ( A )  of  (7) 
as  are  polynomials  in  A,^°  and  among  these  solutions  we  choose  one  of  least 
possible  degree  e : 

X  (A)  =Xo-^Xj^  +  A2a;2 +  (-  m^Xe  (Xe^O).  (8) 

Substituting  this  solution  in  (7)  and  equating  to  zero  the  coefficients  of 
the  powers  of  X,  we  obtain : 

Axq^o,  Bxq  —  Axi=o,  Bxi  —  Ax2=o,  ...  ,  Bxe-i  —  Axe  =  o,  Bxe  =  o.     (9) 

Considering  this  as  a  system  of  linear  homogeneous  equations  for  the 
elements  of  the  columns  Xq,  — Xi,  +  X2  . . . ,  ( —  l)*iCe„  we  deduce  that  the 
coefficient  matrix  of  the  system 

e+l 


Me=MeiA  +  m=i     ^        ^        •  .        I  (10) 


is  of  rank  Qe  <  (e  +  l)n.    At  the  same  time,  by  the  minimal  property  of  e, 
the  ranks  Qo,Qu  . . .  y  Qs—i  of  the  matrices 


'A    0 

.  .  .  0 

B    A 

. 

0    b' 

• 

• 

•  .  A 

0   6  . 

.  .   B 

10  For  the  actual  determination  of  the  elements  of  the  column  x  satisfying  (7)  it  is 
convenient  to  solve  a  system  of  linear  homogeneous  equations  in  which  the  coefficients  of 
the  unknown  depend  linearly  on  I.  The  fundamental  linearly  independent  solutions  x  can 
always  be  chosen  such  that  their  elements  are  polynomials  in  X. 
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Mo 


Me- 


A 

0  . 

.  .  0' 

B 

A 

• 

•     A 

.0 

. 

.   B 

(100 


satisfy  the  equations  Qq=  n,  Qi=^2n,  .  .  ,,  Qe__i  ^en. 

Thus :  The  number  e  is  the  least  value  of  the  index  k  for  which  the  sign 
<  holds  in  the  relation  Qk"^  (k  +  l)n. 

Now  we  can  formulate  and  prove  the  following  fundamental  theorem : 

2.  Theorem  4 :  If  the  equation  (7)  has  a  solution  of  minimal  degree  e  and 
e  >  0,  then  the  given  pencil  A  +  XB  is  strictly  equivalent  to  a  pencil  of 
the  form 

L,         0 

vO     A  +  XBj' 
where 


(11) 


e  +  l 


L  = 


X     1 
0     X 


0      0 


0  . 

1 


.  0      0 


.    .    .  A      1 


(12) 


and  A-\-  IB  is  a  pencil  of  matrices  for  which  the  equation  analogous  to  (1) 
has  no  solution  of  degree  less  than  e. 

Proof.  We  shall  conduct  the  proof  of  the  theorem  in  three  stages.  First, 
we  shall  show  that  the  given  pencil  A  +  XB  is  strictly  equivalent  to  a  pencil 
of  the  form 


(L,    D  +  XF 
[0     A+XB 


(13) 


where  D,  F,  A,  B  are  constant  rectangular  matrices  of  the  appropriate 
dimensions.  Then  we  shall  establish  that  the  equation  (A  +  XB)x  =  0  has 
no  solution  xiX)  of  degree  less  than  e.  Finally,  we  shall  prove  that  by 
further  transformations  the  pencil  (13)  can  be  brought  into  the  quasi- 
diagonal  form  ( 11 ) . 
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1.  The  first  part  of  the  proof  will  be  couched  in  geometrical  terms. 
Instead  of  the  pencil  of  matrices  A  +  ?.B  we  consider  a  pencil  of  operators 
A-\-  XB  mapping  R„  into  Rm  and  show  that  with  a  suitable  choice  of  bases 
in  the  spaces  the  matrix  corresponding  to  the  operator  A  +  XB  assumes  the 
form  (13). 

Instead  of  (7)  we  take  the  vector  equation 


{A+XB)x=o 
with  the  vector  solution 

the  equations  (9)  are  replaced  by  the  vector  equations 


Axq=o,     Axi=Bxq,     Ax^  =  Bx^,  ...,Ax^=Bx 


e-l 


Bx. 


Below  we  shall  show  that  the  vectors 
Ax^,  Ax2, . . 


Ax, 


(14) 


(15) 


(16) 


(17) 


are  linearly  independent.    Hence  it  will  be  easy  to  deduce  the  linear  inde- 
pendence of  the  vectors 

(18) 


^n.  X-, 


For  since  Axq  =  o  we  have  from    ao  ^o  +  ^i  *i  + ^■  °^s^s  —  ^  that 

aj  -4  :c,  -f  •  — I-  ac  -4  aCe  =  o,  so  that  by  the  linear  independence  of  the  vectors 


0,  since  otherwise  j  a: (2)  would  be 


(17)  ai  =  a2  =  .  .  .  =  ae  =  0.     But  :«o 

a  solution  of  (14)  of  degree  s  —  1,  which  is  impossible.     Therefore  an  =  0 
also. 

Now  if  we  take  the  vectors  (17)  and  (18)  as  the  first  e  +  1  vectors  for 
new  bases  in  Km  and  Rn,  respectively,  then  in  these  new  bases  the  operators 
A  and  B,  by  (16),  will  correspond  to  the  matrices 


e+l 


fi+1 


A  = 


0 

1 

...  0 

*  . . 

.  * 

1 

0  ...  0 

0 

* 

. .  .   * 

0 

0 

1  ...  0 

*  . . 

.  * 

0 

1  ...  0 

0 

* 

.  .  .   * 

6 

6 

. . .  i 

*  . . 

.  * 

,     ^- 

6 

0  ...  1 

6 

* 

.  . .   * 

0 

0 

...  0 

*  . . 

.  * 

0 
0 

0  ...  0 
.  .   .   .  0 

0  ...  0 

0 
0 

0 

* 
* 

* 

. . .   * 

...    * 

0 

0 

...  0 

*  . . 

.  * 

. . .   ♦ 

32  XII.    Singular  Pencils  of  Matrices 

hence  the  A-matrix  A-\-  XB  is  of  the  form  (13).  All  the  preceding  argu- 
ments will  be  justified  if  we  can  show  that  the  vectors  (17)  are  linearly- 
independent.  Assume  the  contrary  and  let  Ax^  (h"^!)  be  the  first  vector 
in  (17)  that  is  linearly  dependent  on  the  preceding  ones : 

Ax^  =  0C^AX^_^  -h  0L2AXj^_^  +  •  •  •  +  (Xh-i^x^ . 

By  (16)  this  equation  can  be  rewritten  as  follows: 
i.e., 

Bx;_^  =  o, 

where 

Furthermore,  again  by  (16), 

^K-l  =  ^  i^h-2  —  <^l^b-3 <^h-2^o)  =  ^K-2  ' 

where 

Continuing  the  process  and  introducing  the  vectors 

we  obtain  a  chain  of  equations 

Bx;_^^o,     Ax*_^=Bxl_^,,.,,Axl  =  Bxl,     Ax^^o.         (19) 

From  (19)  it  follows  that 

x*  W  =  x;  -^xl  +  . ..  +  (-  l)^-X_,     {xl  =x,^o) 

is  a  non-zero  solution  of  (14)  of  degree  ^h  —  1  <  e,  which  is  impossible. 
Thus,  the  vectors  (17)  are  linearly  independent. 

2  We  shall  now  show  that  the  equation  (-4  +  ^B)  x  =ohas  no  solutions 
of  degree  less  than  e.  To  begin  with,  we  observe  that  the  equation  Ley  =  o, 
like  (7),  has  a  non-zero  solution  of  least  degree  e.  We  can  see  this  imme- 
diately, if  we  replace  the  matrix  equation  Le  y=  o  by  the  system  of  ordinary 
equations 
^2/1  +  2/2^0,     ^2/2  +2/3=0,...,  Xy,  +  t/^+i  =0     (y=  {y^,  2/2>  •  •  •  >  ye+i))\ 

2/*  =  (-l)*-'2/iA*-i     (A:  =  l,  2,  ...,£+ 1). 
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On  the  other  hand,  if  the  pencil  has  the  'triangular'  form  (13)  then  the 

corresponding  matrix  pencil  Mjc  (k  =  0,l, e)   (see  (10)  and  (10')  on 

pp.   29  and   30)    can  also  be  brought  into  triangular  form,  after  a  suitable 
permutation  of  rows  and  columns : 


.      ,  (20) 

For  /c  =  e  —  1  all  the  columns  of  this  matrix,  like  those  of  Me-^i  [iye],are 
linearly  independent.^^  But  J/g— i  [i>e]is  a  square  matrix  of  order  e(e  +  1). 
Therefore  in  Me—\  [A  +  ^-S]  also,  all  the  columns  are  linearly  independent 
and,  as  we  have  explained  at  the  beginning  of  the  section,  this  means  that  the 
equation  {A  +  XB)  x  =o  has  no  solution  of  degree  less  than  or  equal  to  e  —  1, 
which  is  what  we  had  to  prove. 


3.    Let  us  replace  the  pencil  (13)  by  the  strictly  equivalent  pencil 
El     Y\IL,     D^?iF\(E^     -X\      (L,     D  +  2.F+Y{A-h  m-LJ^^ 


\0     A  +  XBJXO         eJ      \0  A  +  XB  /'  ^     ^ 


where  Ei,  E2,  E3,  and  E^  are  square  unit  matrices  of  orders  e,  m  —  e,  e  +  1, 
and  n  —  e  —  1,  respectively,  and  X,  Y  are  arbitrary  constant  rectangular 
matrices  of  the  appropriate  dimensions.  Our  theorem  will  be  completely 
proved  if  we  can  show  that  the  matrices  X  and  Y  can  be  chosen  such  that  the 
matrix  equation 

L,X=D  +  XF+Y{A-{-  XB)  (22) 

holds. 

We  introduce  a  notation  for  the  elements  of  2),  F,  X  and  also  for  the 

rows  of  Y  and  the  columns  of  A  and  B : 

D=\\d,,\,    F=\\U,\\,     X=\\x,,\\ 
(i=:l,2,  ...,e;     A;=l,  2,  ...,7i-e-l;     /  =  1,  2,  . . .,  e  +  1) , 

2/2 
7=1    •     I,     A=  {a^,  a^,  . . . ,  a^_,_i) ,     jB  =  (61,  62,  •  •  • »  ^n-s-i) • 


Then  the  matrix  equation  (22)  can  be  replaced  by  a, system  of  scalar  equa- 
tions that  expresses  the  equality  of  the  elements  of  the  A;-th  column  on  the 
right-hand  and  left-hand  sides  of  (22)  (fc  =  1,  2,  . . . ,  w  —  e  —  1)  : 


11  This  follows  from  the  fact  that  the  rank  of  the  matrix  (20)  for  A:=:  £  —  1  is  equal 
to  en;  a  similar  equation  holds  for  the  rank  of  the  matrix  -3fc_i  \Le\ 
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^3k  +  ^^2k  =  ^2*  +  ^f2k  +  2/2^4  +  ^Vzh  y 

^u  +  ^^sk  =  dsk  +  ^hk  +  yz<^k  +  'I2/3&A »  (23) 


^e+l,k  +  ^^ek  =dek  +  ^fek  +  Ve^k  +  ^Vfik 

{k  =  l,2,  ...,n  —  e—l). 

The  left-hand  sides  of  these  equations  are  linear  binomials  in  X.  The 
free  term  of  each  of  the  first  e  —  1  of  these  binomials  is  equal  to  the  coeffi- 
cient of  X  in  the  next  binomial.  But  then  the  right-hand  sides  must  also 
satisfy  this  condition.    Therefore 

2/1%  —  2^2^*  =  /2A  —  ^Ik  » 
2/2«*  —  2/3^*  = /sifc  —  <^2i  > 

2/e-l%  —  Veh  =fek  "  4-1.  fc 

{k  =  l,2,...,n-e-l). 

If  (24)  holds,  then  the  required  elements  of  X  can  obviously  be  determined 
from  (23). 

It  now  remains  to  show  that  the  system  of  equations  (24)  for  the  ele- 
ments of  Y  always  has  a  solution  for  arbitrary  di^  and  fijc  {i  =  l,  2,  .  .  .  ,  e; 
A;  =  l,  2,  . . . ,  n  —  £  —  1).  Indeed,  the  matrix  formed  from  the  coefficients 
of  the  unknown  elements  of  the  rows  2/1,  —  2/2,  2/3,  —  2/4,  ... ,  can  be  written, 
after  transposition,  in  the  form 

e-l 


A 

0  . 

.  .  0 

B 

A 

• 

0 

B 

• 

. 

A 

0 

0  . 

,  B 

But  this  is  the  matrix  Mt-2  for  the  pencil  of  rectangular  matrices  A-\-  XB 
(see  (10')  on  p.  30).  The  rank  of  the  matrix  is  (e  —  1)  {n  —  £  —  1),  be- 
cause the  equation  {A  +  XB)  ^  =  o,  by  what  we  have  shown,  has  no  solutions 
of  degree  less  than  £.  Thus,  the  rank  of  the  system  of  equations  (24)  is 
equal  to  the  number  of  equations  and  such  a  system  is  consistent  (non- 
contradictory)  for  arbitrary  free  terms. 

This  completes  the  proof  of  the  theorem. 
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§  4.     The  Canonical  Form  of  a  Singular  Pencil  of  Matrices 

1.  Let  A  -\-  XB  be  an  arbitrary  singular  pencil  of  matrices  of  dimension 
m  X  n.  To  begin  with,  we  shall  assume  that  neither  among  the  columns 
nor  among  the  rows  of  the  pencil  is  there  a  linear  dependence  with  constant 
coefficients. 

Let  r  <^  n,  where  r  is  the  rank  of  the  pencil,  so  that  the  columns  oi  A  +  XB 
are  linearly  dependent.  In  this  case  the  equation  (A  +  XB)x  =  o  has  a  non- 
zero solution  of  minimal  degree  ei.  From  the  restriction  made  at  the  begin- 
ning of  this  section  it  follows  that  Ci  >  0.  Therefore  by  Theorem  4  the 
given  pencil  can  be  transformed  into  the  form 


0     Ai  +  XB 


.)• 


where  the  equation  {A^^  -f  Wj)  a;<^>  =  o  has  no  solution  x^^^  of  degree  less 
than  ei. 

If  this  equation  has  a  non-zero  solution  of  minimal  degree  £2  (where, 
necessarily,  £2  ^  £1),  then  by  applying  Theorem  4  to  the  pencil  Ai  +  XBi 
we  can  transform  the  given  pencil  into  the  form 

/Le,       0  0  \ 

[0      Ls,         0       I 
\0      0     A^  +  XBj 

Continuing  this  process,  we  can  put  the  given  pencil  into  the  quasi- 
diagonal  form 


K 


P  Ap  +  XBp, 


(25) 


where  0  <  £1  ^  £2  ^  . . .  ^  £p  and  the  equation  {Aj,  4-  XBp)  x^p^  =0  has  no 
non-zero  solution,  so  that  the  columns  of  Ap  +  XBp  are  linearly  independent.^^ 
If  the  rows  of  Ap  +  A5p  are  linearly  dependent,  then  the  transposed 
pencil  A^  -f  XB^pCan  be  put  into  the  form  (25),  where  instead  of  £1,  £2, . . . ,  £p 
there  occur  the  numbers  (0  <)t]^  ^r/^^  >*  -  ^  rj^  .^^  But  then  the  given 
pencil  A  +  XB  turns  out  to  be  transformable  into  the  quasi-diagonal  form 


12  In  the  special  case  where  Ci  +  es  +  .  .  .  +  fu  =  w  the  block  Ap  +  XBp  is  absent. 
^"  Since  no  linear  dependence  with  constaht  coefficients  exists  among  the  rows  of  the 
pencil  A  -\-  XB  and  consequently  of  Ap  +  XBp,  we  have  )^i  >  0. 
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Le,  0 


Le, 


LI 


Ll 


(26) 


0  Ao  +  XBq 

where  both  the  columns  and  the  rows  of  Aq  +  kBo  are  linearly  independent, 
i.e.,  Ao  +  XBo  is  a  regular  pencil.^^ 

2.  We  now  consider  the  general  case  where  the  rows  and  the  columns  of 
the  given  pencil  may  be  connected  by  linear  relations  with  constant  coeffi- 
cients. We  denote  the  maximal  number  of  constant  independent  solutions 
of  the  equations 

U  +  ^B)x  =  o    and     (A'^  +  XB'^)=o 

by  g  and  h,  respectively.  Instead  of  the  first  of  these  equations  we  consider, 
just  as  in  the  proof  of  Theorem  4,  the  corresponding  vector  equation 
(A  +  }.B)x  =  o  (A  and  B  are  operators  mapping  Rn  into  Rm)-  We  denote 
linearly  independent  constant  solutions  of  this  equation  by  ci,  ©2,  . . . ,  c^ 
and  take  them  as  the  first  g  basis  vectors  in  R„.  Then  the  first  g  columns 
of  the  corresponding  matrix  A  +  XB  consist  of  zeros 

9 

A-i-},B  =  (d,Ai  +  ^Bi).  (27) 

Similarly,  the  first  h  rows  of  the  pencil  Ai  +  XBi  can  be  made  into  zeros. 
The  given  pencil  then  assumes  the  form 

MO  0 


(28) 
0     A'^  +  XBV 


^®  If  in  the  given  pencil  r  =  n,  i.e.,  if  the  columns  of  the  pencil  are  linearly  independent, 
then  the  first  p  diagonal  blocks  in  (26)  of  the  form  L,  are  absent  (p^O).  In  the  same 
way,  if  r  =  in,  i.e.,  if  the  rows  of  A  -\- XB  are  linearly  independent,  then  in  (26)  the 
diagonal  blocks  of  the  form  lJ  are  absent  (q=:0). 
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where  there  is  no  longer  any  linear  dependence  with  constant  coefficients 
among  the  rows  or  the  columns  of  the  pencil  A°  +  XB°.  The  pencil  A°  +  ?.B^ 
can  now  be  represented  in  the  form  (26).  Thus,  in  the  general  case,  the 
pencil  A  +  XB  can  always  be  put  into  the  canonical  quasi-diagonal  form 

{no,    Le,^,,...,Lej„    A+i,  . . . ,  Z,J,,    ^o  +  ^^o}-  (29) 

The  choice  of  indices  for  e  and  rj  is  due  to  the  fact  that  it  is  convenient  here 
to  take  e-i^=e2="-=  Sg^O  and  7]^=  ^2  — '  *  *  — ^a  —  ^• 

When  we  replace  the  regular  pencil  ^0  +  ^Bq  in  (29)  by  its  canonical 
form  (6)   (see  §  2,  p.  28),  we  finally  obtain  the  following  quasi-diagonal 
matrix 
9 

HO;    L.,+„...,Ap;    Li:,+„...,L\;    iVr(«>),  . . . ,  i^<«^) ;    J  +  XE),     (30) 

where  the  matrix  J  is  of  Jordan  normal  form  or  of  natural  normal  form  and 

The  matrix  (30)  is  the  canonical  form  of  the  pencil  A-\-  XB  in  the  most 
general  case. 

In  order  to  determine  the  canonical  form  (30)  of  a  given  pencil  imme- 
diately, without  carrying  out  the  successive  reduction  processes,  we  shall, 
following  Kronecker,  introduce  in  the  next  section  the  concept  of  minimal 
indices  of  a  pencil. 


§  5.    The  Minimal  Indices  of  a  Pencil.     Criterion  for 
Strong  Equivalence  of  Pencils 

1.  Let  A  +  XB  be  an  arbitrary  singular  pencil  of  rectangular  matrices. 
Then  the  k  polynomial  columns  Xi{X),  X2{X),  . . . ,  xjc{X)  that  are  solutions 
of  the  equation 

{A  +  XB)x  =  o  (31) 

are  linearly  dependent  if  the  rank  of  the  polynomial  matrix  formed  from 
these  columns  X=  [xi(A),  a;2(A),  . . . ,  XjdX)]  is  less  than  k.  In  that  case 
there  exist  k  polynomials  pi(X),  P2(^),  •  •  • ,  Pk{X),  not  all  identically  zero, 
such  that 

Pi(A)  ar,  W  -f  p^W  x^W  +  '"  +  pM  xM^O. 

But  if  the  rank  of  X  is  k,  then  such  a  dependence  does  not  exist  and  the 
solutions  Xi{X),  X2(X),  . . . ,  Xk(X)  are  linearly  independent. 
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Among  all  the  solutions  of  (31)  we  choose  a  non-zero  solution  xi(}.)  of 
least  degree  Si.  Among  all  the  solutions  of  the  same  equation  that  are  lin- 
early independent  of  Xi(X)  we  take  a  solution  iC2(A)  of  least  degree  €2- 
Obviously",  Ci  ^  £2.  We  continue  the  process,  choosing  among  the  solutions 
that  are  linearly  independent  oi  Xi(X)  and  rc2(^)  a  solution  XsiX)  of  minimal 
degree  £3,  etc.  Since  the  number  of  linearly  independent  solutions  of  (31) 
is  always  at  most  n,  the  process  must  come  to  an  end.  We  obtain  a  funda- 
mental series  of  solutions  of  (31) 

XiW,X2W,...,Xj,W  (32) 

having  the  degrees 

ei^e^^'-'^Sp.  (33) 

In  general,  a  fundamental  series  of  solutions  is  not  uniquely  determined 
(to  within  scalar  factors)  by  the  pencil  A-\-  XB.  However,  two  distinct 
fundamental  series  of  solutions  always  have  one  and  the  same  series  of 
degrees  £1,  £2, . . . ,  fip.  For  let  us  consider  in  addition  to  (32)  another  funda- 
mental series  of  solutions  xi{X),  S2(>i),  ...  with  the  degrees  £1,  £2, 

Suppose  that  in  (33) 

ei  =  *"=e„,<£ni+i  — •••—€»»,<  •  •  • 

and  similarly,  in  the  series  fii,  e^,  . .  • , 

ei=**'=£^x<£«i+i  =  •••=£«,<•  ••. 

Obviously,  £1  =  £1.  Every  column  Xi{X)  (i  =  1,  2,  . . . ,  Wi)  is  a  linear  com- 
bination of  the  columns  Xi{X),  X2(X),  . . . ,  Xn^X),  since  otherwise  the  solu- 
tion a;„,^.i(  A)  in  (32)  could  be  replaced  by  a;i(A),  which  is  of  smaller  degree. 
It  is  obvious  that,  conversely,  every  column  Xi{X)  (i  =  l,  2,  .  .  . ,  n{)  is  a 
linear  combination  of  the  columns  Xi{X),  X2(X),  . . .  ,5«, +  1  (A).  Therefore 
ni=^ni  and  £„,+!  =£';r,+i-  Now  by  a  similar  argument  we  obtain  that 
n2  =  ^2  and  £«,  + 1  =  £n,-f-i>    etc. 

2.  Every  solution  X]c{X)  of  the  fundamental  series  (32)  yields  a  linear 
dependence  of  degree  £&  among  the  columns  ot  A  +  XB  ik  =  l,  2,  . . . ,  p). 
Therefore  the  numbers  £1,  £2,  . . . ,  £p  are  called  the  minimal  indices  for  the 
columns  of  the  pencil  A  +  XB. 

The  minimal  indices  1^1,  7)2, .  • .,  ^g  for  the  rows  of  the  pencil  A-^  XB  are 
introduced  similarly.  Here  the  equation  {A  -{■  XB)x  =  o  is  replaced  by 
(A"""  +  XB'^)y  =  o,  and  rji,  7^2,  -  ■  -  ,  Vq  ^^^  defined  as  minimal  indices  for  the 
columns  of  the  transposed  pencil  A'^  +  XB'^'. 
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Strictly  equivalent  pencils  have  the  same  minimal  indices.  For  let 
A-\-  XB  and  P{A  +  XB)Q  be  two  such  pencils  (P  and  Q  are  non-singular 
square  matrices) .  Then  the  equation  ( 31)  for  the  first  pencil  can  be  written, 
after  multiplication  on  the  left  by  P,  as  follows : 

PiA+kB)Q'Q^x=o, 

Hence  it  is  clear  that  all  the  solutions  of  (31),  after  multiplication  on  the 
left  hy  Q~^,  give  rise  to  a  complete  system  of  solutions  of  the  equation 

P{A  +  XB)Qz=o. 

Therefore  the  pencils  A  +  XB  and  P{A-\-  XB)Q  have  the  same  minimal 
indices  for  the  columns.  That  the  minimal  indices  for  the  rows  also  coincide 
can  be  established  by  going  over  to  the  transposed  pencils. 

Let  us  compute  the  miiimal  indices  for  the  canonical  quasi-diagonal 
matrix 


(        ^  T  T  1 


(34) 


{Ao  +  XBo  is  a  regular  pencil  having  the  normal  form  (6) ). 

We  note  first  of  all  that :  The  complete  system  of  indices  for  the  columns 
{rows)  of  a  quasi-diagonal  matrix  is  obtained  as  the  union  of  the  correspond- 
ing systems  of  minimal  indices  of  the  individual  diagonal  blocks.  The  matrix 
Le  has  only  one  index  e  for  the  columns,  and  its  rows  are  linearly  independ- 
ent. Similarly,  the  matrix  Z^  has  only  one  index  rj  for  the  rows,  and  its 
columns  are  linearly  independent.  Therefore  the  matrix  (34)  has  as  its 
minimal  indices  for  the  columns 

and  for  the  rows 

We  note  further  that  L^  has  no  elementary  divisors,  since  among  its 
minors  of  maximal  order  e  there  is  one  equal  to  1  and  one  equal  to  >i^  The 
same  statement  is,  of  course,  true  for  the  transposed  matrix  Ll.  Since  the 
elementary  divisors  of  a  quasi-diagonal  matrix  are  obtained  by  combining 
those  of  the  individual  diagonal  blocks  (see  Volume  I,  Chapter  VI,  p.  141), 
the  elementary  divisors  of  the  X-matrix  (34)  coincide  with  those  of  its  regular 
'kernel'  Ao  +  XBq. 

The  canonical  form  of  the  pencil  (34)  is  completely  determined  by  the 
minimal  indices  Ej,  .  .  .  ,  e^,  rjx,  -  .  -  ,rjq  and  the  elementary  divisors  of  the 
pencil  or,  what  is  the  same,  of  the  strictly  equivalent  pencil  A  +  XB.    Since 
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two  pencils  having  one  and  the  same  canonical  form  are  strictly  equivalent, 
we  have  proved  the  following  theorem : 

Theorem  5  (Kronecker ) :  Two  arbitrary  pencils  A-\-  XB  and  A^  +  AJ5i 
of  rectangular  matrices  of  the  sa/me  dimension  mX  n  are  strictly  equivalent 
if  and  only  if  they  have  the  same  minimal  indices  and  the  same  {finite  and 
infinite)  elementary  divisors. 

In  conclusion,  we  write  down,  for  purposes  of  illustration,  the  canonical 
form  of  a  pencil  A  +  XB  with  the  minimal  indices  £i  =  0,  £2  =  1,  £3  =  2, 
^1  =  0,  ^^2  =  0,  1/3  =  2  and  the  elementary  divisors  A^,  (A  +  2)^,  ^3  '}^ 


A     1 


A     1     0 
0    A     1 


;  A 

0 

:     1 

A 

;  0 

1 

1 

A 

0 

0 

1 

A 

0 

0 

1 

A 

1  ; 

0 

A; 

A  +  2     1 
0    A  +  2 


(35) 
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1.    Suppose  given  two  complex  quadratic  forms : 


A  (a;,  x)  =  ^  ciik^iX^ 

i,k=l 


k,         B{x,x)=  ^hij,XiX„\ 

i,k=l 


(36) 


they  generate  a  pencil  of  quadratic  forms  A{x,x)  +  XB{x,x).  This  pencil 
of  forms  corresponds  to  a  pencil  of  symmetric  matrices  A  +  AB  {A^  =  A, 
B'^=B).  If  we  sub j ect  the  variables  in  the  pencil  of  forms  A(x,x)  +  XB(x,x) 
to  a  non-singular  linear  transformation  x  =  Tz  (|  T  |  7^  0),  then  the  trans- 
formed pencil  of  forms  A{z,z)  +XB{z,z)  corresponds  to  the  pencil  of 
matrices 


'5  All  the  elements  of  the  matrix  that  are  not  mentioned  expressly  are  zero. 
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A  +  XB  =  T^{A  +  XB)T  ;  (37) 

here  T  is  a  constant  (i.e.,  independent  of  X)  non-singular  square  matrix  of 
order  n. 

Two  pencils  of  matrices  A  +  XB  and  A-\-  XB  that  are  connected  by  a  rela- 
tion (36)  are  called  congruent  (see  Definition  1  of  Chapter  X ;  Vol.  I,  p.  296) . 
Obviously,  congruence  is  a  special  case  of  equivalence  of  pencils  of 
matrices.  However,  if  congruence  of  two  pencils  of  symmetric  (or  skew- 
symmetric)  matrices  is  under  consideration,  then  the  concept  of  congruence 
coincides  with  that  of  equivalence.  This  is  the  content  of  the  following 
theorem. 

Theorem  6:  Two  strictly  equivalent  pencils  of  complex  symmetric  {or 
skew-symmetric)  matrices  are  always  congruent. 

Proof.  Let  A^  A  +  XB  and  A^  A  +  XB  be  two  strictly  equivalent 
pencils  of  symmetric  (skew-symmetric)  matrices: 

A=PAQ    (A-'  =  ±A,  A''  =  ±A',    iP|^0,    |«|7^0).  (38) 

By  going  over  to  the  transposed  matrices  we  obtain : 

A  =  Q''AP^,  (39) 

From  (38)  and  (39),  we  have 

AQP^-^=P~^Q''A.  (40) 

Setting 

V-QP^-\  (41) 

we  rewrite  (40)  as  follows: 

AU=U^A.  (42) 

From  (42)  it  follows  easily  that 

AU^=V''^A      (A;  =  0,  1,  2, ...) 
and,  in  general, 

AS  =  8'' A  ,  (43) 

where 

S=f{U),  (44) 

and  f{X)  is  an  arbitrary  polynomial  in  X.     Let  us  assume  that  this  polj^- 
nomial  is  chosen  such  that  |  ^8^  |  ^.^  0.    Then  we  have  from  (43)  : 

A  =  S''A8-^.  (45) 


42  XII.    Singular  Pencils  of  Matrices 

Substituting  this  expression  for  A  in  (38),  we  have: 

A=PS^AS-^Q.  (46) 

If  this  relation  is  to  be  a  congruence  transformation,  the  following  equa- 
tion must  be  satisfied : 

(P8T  =  S-^Q, 
which  can  be  rewritten  as 

Now  the  matrix  S  =  f(U)  satisfies  this  equation  if  we  take  as  /(a)  the 
interpolation  polynomial  for  ]/A  on  the  spectrum  of  TJ.  This  can  be  done, 
because  the  many -valued  function  fJ  has  a  single-valued  branch  determined 
on  the  spectrum  of  TJ,  since  |  C/"  |  =7^  0. 

The  equation  (46)  now  becomes  the  condition  for  congruence 

A=T^AT     {T=^SQ=iQF^Q).  (47) 

From  this  theorem  and  Theorem  5  we  deduce : 
Corollary  :    Two  pencils  of  quadratic  forms 

A  {x,  x)  +  ?.B  {x,  x)     and     A  (z,  z)  +  ^B  (z,  z) 

can  he  carried  into  one  another  hy  a  transformation  x'=Tz  (|  T  |  7^  0)  if 
and  only  if  the  pencils  of  symmetric  matrices  A-\-  XB  and  A-h  XB  have  the 
same  elementary  divisors  (finite  and  infinite)  and  the  same  minimal  indices. 
Note.  For  pencils  of  symmetric  matrices  the  rows  and  columns  have  the 
same  minimal  indices : 

p  =  q;     Si  =  r]i,.,.,Sj,=  'np.  (48) 

2.  Let  us  raise  the  following  question :  Given  two  arbitrary  complex  quad- 
ratic forms 

n  n 

A  (x,  x)=  ^  (^ik^iX/c ,         B  {x,  x)  =  ^  ^ik^i^k ' 

i,k=l  i,k=l 

Under  what  conditions  can  the  two  forms  be  reduced  simultaneously  to 
sums  of  squares 

n  n 

^  aiz\     and      ^hf]  (49) 

1=1  i=l 

hy  a  non-singular  transformation  of  the  variables  x  =  Tz  ( |  T  |  ^^  0)  ? 
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Let  us  assume  that  the  quadratic  forms  A{x,x)  and  B(x,x)  have  this 
property.  Then  the  pencil  of  matrices  A  +  ^B  is  congruent  to  the  pencil 
of  diagonal  matrices 

{ai  +  ;i6i,  a^  +  A62,  ...,«„  +  A6„} .  (50) 

Suppose  that  among  the  diagonal  binomials  ai  +  A&i  there  are  precisely  r 
(r^n)  that  are  not  identically  zero.  Without  loss  of  generality  we  can 
assume  that 

ai=bi=0,,..,a„_=b^_r=0,  a,.  +  A6..^0     (i  =  n-r+l,  ...,n).   (51) 

Setting  ^ 

Ao  +  ABo  =  {«„_,+!  +  A6„_,+i,  ...,«„  +  A6J ,  (52) 

we  represent  the  matrix  (51)  in  the  form 


{0,     Ao  +  XBo),  (53) 

Comparing  (52)  with  (34)  (p.39) ,  we  see  that  in  this  case  all  the  minimal 
indices  are  zero.  Moreover,  all  the  elementary  divisors  are  linear.  Thus 
we  have  obtained  the  following  theorem : 

Theorem  7 :  Two  quadratic  forms  Aix,x)  and  B{x,x)  can  he  reduced 
simultaneously  to  sums  of  squares  (49)  hy  a  transformation  of  the  variables 
if  and  only  if  in  the  pencil  of  matrices  A  +  XB  all  the  elementary  divisors 
{finite  and  infinite)  are  linear  and  all  the  minimal  indices  are  zero. 

In  order  to  reduce  two  quadratic  forms  A(x,x)  and  B{x,x)  simulta- 
neously to  some  canonical  form  in  the  general  case,  we  have  to  replace  the 
pencil  of  matrices  A  +  XB  hy  a  strictly  equivalent  '  canonical '  pencil  of 
symmetric  matrices. 

Suppose  the  pencil  of  symmetric  matrices  A  -\-  XB  has  the  minimal  indices 
£1  =  . . .  =  e^  =  0,  Sg  +  i  y^O,  .  . . ,  Spy^O,  the  infinite  elementary  divisors 
/*"S  ^"«,  .  .  .  ,  //"^  and  the  finite  ones  {2.  +  X^)'^,  {X  +  /t^Y', .  .  .  ,(^  +  XfYt  Then, 
in  the  canonical  form  (30),  g  =  h,p  =  q  and  e^+i  =  rjg+i,  .  .  . ,  ep  =  rjp.  We 
replace  in  (30)  every  two  diagonal  blocks  of  the  form  Le  and  Lj  by  a  single 

diagonal  block  1^   ^^\  and  each  block  of  the  form  iV^"^  =  ^^«^+;fi^(«)by  the 
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strictly  equivalent  symmetric  block 


2f{u)=y(u)]^(u): 


0     0  ...  0     1 

0  0  ...  1     A 

1  A  ...  0     0 


0 

0  ...  0 

1 

0 

0  ...   1 

0 

with    F<«)  = 

• 

. 

1  * 

. 

I 

0  ...  0 

0 

(54) 


Moreover,  instead  of  the  regular  diagonal  block  J  +  XE  in  (30)    (J  is  a 
Jordan  matrix) 

J  +  kE  =  {{X  +  Ai)  E^'^^  +  ^<^'>, . . . ,  (A  +  A,)  ^(*^>  +  m^^^} , 

we  take  the  strictly  equivalent  block 

{Z<;'',...,Zf},  (55) 


where 


Zif  =  V^'i^  [{X  +  A,.)  E^'i>  +  H^'*^] 


0      X  +  Xi 

X  +  Xi      1 


(i  =  l,2,...,0.  (56) 


X  +  Xi         1   ....   0 
The  pencil  A  +  AS  is  strictly  equivalent  to  the  symmetric  pencil 


A  +  XB 


'KJ-") (1  ?)••*■■' "-^"^ *)■ 


(57) 


Two  quadratic  forms  with  complex  coefficients  A{x,x)  and  B(x,x)  can 
he  simultaneously  reduced  to  the  canonical  forms  A{z,z)  and  B(z,z)  defined 
in  (57)  hy  a  transformation  of  the  variables  x  =  Tz  {\  T  \y^0).^'' 


1^  In  the  Russian  edition  the  author  stated  that  propositions  analogous  to  Theorems  6 
and  7  hold  for  hermitian  forms.  A.  I.  Mai  'cev  has  pointed  out  to  the  author  that  this  is 
not  the  case.    As  regards  singular  pencils  of  hermitian  forms,  see  [197  II]. 
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§  7.     Application  to  Differential  Equations 

1.  The  results  obtained  will  now  be  applied  to  a  system  of  m  linear  differ- 
ential equations  of  the  first  order  in  n  unknown  functions  with  constant 
coefficients  :^^ 

2:a,,x,+  Jjb,,^  =  f,(t)      (i=l,2,  ...,m),  (58) 


Ax  +  B~=f{t);  (59) 

here^® 


or  in  matrix  notation : 

dt 

^=||a,;,||,     B  =  \\b,,\\      (1  =  1,2,  ...,m;  A;-l,2,...,n), 
x={xi,  x^,  ...,  xj ,     /  ==(/i,  /a,  . . . ,  /J  • 

We  introduce  new  unknown  functions  Zi,  22,  •  •  • ,  2!n  that  are  connected 
with  the  old  Xi,  X2,  .  . . ,  Xn  hy  a  linear  non-singular  transformation  with 
constant  coefficients: 

x=Qz     {z  =  {z,,z^,,..,zj;  \Q\y^O).  (60) 

Moreover,  instead  of  the  equations  (58)  we  can  take  m  arbitrary  inde- 
pendent combinations  of  these  equations,  which  is  equivalent  to  multiplying 
the  matrices  A,  B,  f  on  the  left  by  a  square  non-singular  matrix  P  of  order  m. 
Substituting  Qz  for  ic  in  (59)  and  multiplying  (59)  on  the  left  by  P,  we 
obtain : 

Az+Bp^=ht),  (61) 

where 

A=PAQ,     B  =  PBQ,     /'=P/=(/;,/2,  ...Jn).  (62) 

The  matrix  pencils  A  +  XB  and  A-]-  XB  are  strictly  equivalent : 

A  +  XB  =  P{A  +  XB)Q.  (63) 

We  choose  the  matrices  P  and  Q  such  that  the  pencil  A-\-  XB  has  the 
canonical  quasi-diagonal  form 


18  The  particular  case  where  m  =  n  and  the  system  (58)  is  solved  with  respect  to  the 
derivatives  has  been  treated  in  detail  in  Vol.  I,  Chapter  V,  §  5. 

It  is  well  known  that  a  system  of  linear  differential  equations  with  constant  coeffi- 
cients of  arbitrary  order  s  can  be  reduced  to  the  form  (58)  if  all  the  derivatives  of  the 
unknown  functions  up  to  and  including  the  order  s  —  1  are  included  as  additional  unknown 
functions. 

19  w^e  recall  that  parentheses  denote  column  matrices.  Thus,  x^  (xi,  X2,  .  .  .  ,  Xn)  is  the 
column  with  the  elements  Xi,  X2,  . .  .  ,  Xn. 
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A  +  XB={0,  Le,^,,  . . . ,  Le,,  L\^„  . . . ,  Ll,^,  N(-^\  . . . ,  N(-^\  J  +  XE]  •     (64) 

In  accordance  with  the  diagonal  blocks  in  (64)  the  system  of  differential 
equations  splits  into  v  =  p  —  g  +  q  —  h  +  s  +  2  separate  systems  of  the  form 


O'Z^f 


(65) 


.T      /d\p-ff+'i-+i 


U) 


p-9+l+j 


z        =        / 


(i=l,2,  ...,p-j),        (66) 


(/=1,2,  ...,2-A),        (67) 


^'"*'0 


p-^+<?-A+l+* 


2  =  /  (A:  =  l,2,  ...,5) 


where 


(•^+^)-/ 


z  = 


z  =  (^1,  . . . ,  Zg),  f=  (71,  . . . ,  A),  z=  (2^+1,  . . .),  f  =  ih+u  ' ' ')     etc., 


(68) 
(69) 


(70) 


(71) 
(72) 


Thus,  the  integration  of  the  system  (59)  in  the  most  general  case  is 
reduced  to  the  integration  of  the  special  systems  (65) -(69)  of  the  same  type. 
In  these  systems  the  matrix  pencil  A  +  XB  has  the  form  0,  Le,  L^y  iV^'«^ ,  and 
J  +XE,  respectively. 

1)   The  system  (65)  is  not  inconsistent  if  and  only  if 


I.e. 


7,^0,...,  7,^0. 


(73) 
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In  that  case  we  can  take  arbitrary  functions  of  t  as  the  unknown  functions 

1 
01,  Z2,  . . . ,  Zg  that  form  the  columns  z. 

2)   The  system  (66)  is  of  the  form 


''^{li)'='f 


(74) 
or,  more  explicitly,^® 

^  +  z^  =  I  it).  ^  +  z^  =  U{t) ^  +  2.+1  =  Ml) .  (75) 

Such  a  system  is  always  consistent.  If  we  take  for  Z£+i(t)  an  arbitrary 
function  of  t,  then  all  the  remaining  unknown  functions  Ze ,  Zg-i,  .  .  . ,  Zi  can 
be  determined  from  (75)  by  successive  quadratures. 

3)  The  system  (67)  is  of  the  form 

or,  more  explicitly,^' 

T«  =  ^iW'  ^  +  %  =  /2(').---.  ^  +  ^',-i  =  f~r,{t),  5,=/;+i«).     (77) 

From  all  the  equations  (77)  except  the  first  we  determine  Ztj,  Zjj-i, . . . ,  Zi 
uniquely : 


z„=/ 


n+ii 


rr  =7         ^/^  +  1 

^''-^      ^^        dt    '  (78) 


Substituting  this  expression  for  Zi  into  the  first  equation,  we  obtain  the 
condition  for  consistency: 


20  We  have  changed  the  indices  of  z  and  /  to  simplify  the  notation.  In  order  to  return 
from  (75)  to  {QQ)  we  have  to  replace  £  by  £i  and  add  to  each  index  of  z  the  number 
9  +  e^+i  H |-e^-|-i_i  +  t  — 1  ,  to  each  index  of  /  the  number  h  +  Sg^^  -\ h  e^+i— i  • 

2^  Here,  as  in  the  preceding  case,  ^Ye  have  changed  the  notation.  See  the  preceding 
footnote. 
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4)  The  system  (68)  is  of  the  form 

iV(«)(^).  =  7  (80) 

or,  more  explicitly, 

Hence  we  determine  successively  the  unique  solutions 

z       =1      -^ 
«*-!      '«-!       d<  '  (82) 

^     '1      d«  +  dt^  ^  ^    ^^       dt«-^  ' 

5)   The  system  (69)  is  of  the  form 

J^  +  Pt=l  (83) 

As  we  have  proved  in  Vol.  I,  Chapter  V,  §  5,  the  general  solution  of  such 
a  system  has  the  form 

t 
z  =  e-^'zo  +  /  e--^  <*-">/  (t)  dr;  (84) 

0 

here  Zo  is  a  column  matrix  with  arbitrary  elements  (the  initial  values  of 
the  unknown  functions  for  ^  =  0) . 

The  inverse  transition  from  the  system  (61)  to  (59)  is  effected  by  the 
formulas  (60)  and  (62),  according  to  which  each  of  the  functions  Xi,  . .  . ,  Xn 
is  a  linear  combination  of  the  functions  Zi,  .  . . ,  Zn  and  each  of  the  functions 
/i(0,  •  •  • ,  /m(0  is  expressed  linearly  (with  constant  coefficients)  in  terms 
of  the  functions  fi(t),  .  .  . ,  fm(t). 

2.  The  preceding  analysis  shows  that :  In  general,  for  the  consistency  of  the 
system  (58)  certain  well-defined  linear  dependence  relations  (with  constant 
coefficients)  must  hold  among  the  right-hand  sides  of  the  equations  and  the 
derivatives  of  these  right-hand  sides. 

If  these  relations  are  satisfied,  then  the  general  solution  of  the  system 
contains  both  arbitrary  constants  and  arbitrary  functions  linearly. 

The  character  of  the  consistency  conditions  and  the  character  of  the 
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solutions  (in  particular,  the  number  of  arbitrary  constants  and  arbitrary 
functions)  are  determined  by  the  minimal  indices  and  the  elementary  divi- 
sors of  the  pencil  A  +  XB,  becaiise  the  canonical  form  (65) -(69)  of  the  sys- 
tem of  differential  equations  depends  on  these  minimal  indices  and  ele- 
mentary divisors. 


CHAPTER    XIII 

MATRICES  WITH  NON-NEGATIVE  ELEMENTS 

In  this  chapter  we  shall  study  properties  of  real  matrices  with  non-negative 
elements.  Such  matrices  have  important  applications  in  the  theory  of 
probability,  where  they  are  used  for  the  investigation  of  Markov  chains 
('stochastic  matrices/  see  [46]),  and  in  the  theory  of  small  oscillations  of 
elastic  systems  ('oscillation  matrices,'  see  [17]). 

§  1.    General  Properties 

1.    We  begin  with  some  definitions. 

Definition  1 :    A  rectcmgular  matrix  A  with  real  elements 
A=\\aij,\\         {i=l,2,  ...,m;k  =  1,2,  ...,n) 

is  called  non-negative  (notation:  A^O)  or  positive  (notation:  A  >  0)  if 
all  the  elements  of  A  are  non-negative  (o^fc  ^  0)  or  positive  (fl^fc  >  0). 

Definition  2:  A  square  matrix  ^=11  <^ik  11 1  is  called  reducible  if  the 
index  set  1,  2,  .  .  . ,  n  can  he  split  into  two  complementary  sets  (without  com- 
mon indices)  ii,  12,  . . . ,  i/i;  ki,  k2,  . . . ,  kp  {fi  +  v  =  n)  such  that 


ha.kp 


0   (a  =  l,2,...,//;i?  =  l,2,...,r). 


Otherwise  the  matrix  is  called  irreducible. 

By  a  permutation  of  a  square  matrix  A  =  11  am  11 "  we  mean  a  permutation 
of  the  rows  of  A  combined  with  the  same  permutation  of  the  columns. 

The  definition  of  a  reducible  matrix  and  an  irreducible  matrix  can  also 
be  formulated  as  follows : 

Definition  2':  A  matrix  A=  ||  o^fc  ||i  is  called  reducible  if  there  is  a 
permutation  that  puts  it  into  the  form 


<  :)■ 


where  B  and  D  are  square  matrices.    Otherwise  A  is  called  irreducible. 
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Suppose  that  A=  11  a.(fc  ||r  corresponds  to  a  linear  operator  A  in  an 
7i-dimensional  vector  space  R  with  the  basis  Ci,  €2, . .  . ,  Cn-  To  a  permutation 
of  A  there  corresponds  a  renumbering  of  the  basis  vectors,  i.e.,  a  transition 
from  the  basis  ei,  ©2, .  . . ,  e„  to  a  new  basis  e\  =  e^-,,  e'g  "=  %>  ••-,«'„  =  e^„  , 
where  {ji,  32, .  .  . ,  jn)  is  a  permutation  of  the  indices  1,  2, .  .  .  ,  n.  The  matrix 
A  then  goes  over  into  a  similar  matrix  A  =  T~^AT.  (Each  row  and  each 
column  of  the  transforming  matrix  T  contains  a  single  element  1,  and  the 
remaining  elements  are  zero.) 

2.    By  a  v-dimensional  coordinate  suhspace  of  R  we  mean  a  subspace  of  R 

with  a  basis  e^.^,  e^^,  .  . . ,  ej^^  (1  ^  A^i  <  A:2  <  .  .  .  <  A^^  ^  n) .    There  are  p  J 

v-dimensional  coordinate  subspaces  of  R  connected  with  a  given  basis 
Ci,  €2,  . . . ,  c„.  The  definition  of  a  reducible  matrix  can  also  be  given  in  the 
following  form : 

Definition  2":  A  matrix  ^  =  11  cluc  11  i  is  called  reducible  if  and  only  if 
the  corresponding  operator  A  has  a  v-dimensional  invariant  coordinate  suh- 
space with  1/  <  71. 

We  shall  now  prove  the  following  lemma : 

Lemma  1 :    If  A^O  is  an  irreducible  matrix  of  order  n,  then 

{E-\-A)*^--'>0.  (1) 

Proof.  For  the  proof  of  the  lemma  it  is  sufficient  to  show  that  for  every 
vector^  (i.e.,  column)  y^o  (y  y^  0)  the  inequality 

{E  +  A)*'-'^y  >  o 
holds. 

This  inequality  will  be  established  if  we  can  only  show  that  under  the 
conditions  y^o  and  y  7^  0  the  vector  z=  {E  -\-  A)y  always  has  fewer  zero 
coordinates  than  y  does.  Let  us  assume  the  contrary.  Then  y  and  z  have 
the  same  zero  coordinates.^  Without  loss  of  generality  we  may  assume  that 
the  columns  y  and  z  have  the  f orm^ 


^  Here  and  throughout  this  chapter  we  mean  by  a  vector  a  column  of  n  numbers.  In 
this  way  we  identify,  as  it  were,  a  vector  with  the  column  of  its  coordinates  in  that  basis 
in  which  the  given  matrix  A  =  \\  aik  |^  determines  a  certain  linear  operator. 

-  Here  we  start  from  the  fact  that  z  =  y  -{-  Ay  and  Ay^o;  therefore  to  positive 
coordinates  of  y  there  correspond  positive  coordinates  of  z. 

3  The  columns  y  and  2  can  be  brought  into  this  form  by  means  of  a  suitable  renumber- 
ing of  the  coordinates  (the  same  for  y  and  z). 
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y  =  i      j,    2  =  (      J   (w>o,  v>o), 

where  the  columns  u  and  v  are  of  the  same  dimension. 
Setting 


21       ^22 

we  have 


\^21       -^22/ 
/w\_^Mll       ^12\/^\         /^ 

\o/      U21     ^22/U/      U 


and  hence 

A^^u  =  0 . 

Since  u  >  0,  it  follows  that 

^21  =  0. 

This  equation  contradicts  the  irreducibility  ot  A. 
Thus  the  lemma  is  proved. 
We  introduce  the  powers  of  A : 

^'=ll4'lli   (2=1,2,...). 

Then  the  lemma  has  the  following  corollary : 

Corollary  :    If  A  "^0  is  an  irreducihle  matrix,  then  for  every  index 
pair  i,k  (1  ^  i,  A;  ^  n)  there  exists  a  positive  integer  q  such  that 

4'>0.  (2) 


Hk 

Moreover,  q  can  always  he  chosen  within  the  hounds 

q-^m  —  1  if  i^k,  | 
q'^m  if  i  =  k,  I 


(3) 


where  m  is  the  degree  of  the  minimal  polynomial  \p{X)  of  A. 

For  let  r(A)  denote  the  remainder  on  dividing  (/i  +  l)"~^  by  'ip{l). 
Then  by  (1)  we  have  r{A)  >  0.  Since  the  degree  of  r(A)  is  less  than  m, 
it  follows  from  this  inequality  that  for  arbitrary  i,k  (1  '^  i,  k  -^  n)  at  least 
one  of  the  non-negative  numbers 

c  (2)  (ntr-l) 

^ik'  ^iki  ^ik  y   •  •  •  »  "tJfc 

is  not  zero.     Since  <5ifc  =  0  for  iy^k,  the  first  of  the  relations  (3)  follows. 
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The   other   relation    (for  i  =  k)    is  obtained   similarly   if   the   inequality 
r(A)  >  0  is  replaced  by  Ar{A)  >  0.^ 

Note.    This  corollary  of  the  lemma  shows  that  in  (1)  the  number  n  —  1 
can  be  replaced  by  m  —  1. 


§  2.     Spectral  Properties  of  Irreducible  Non-negative  Matrices 

1.    In  1907  Perron  found  a  remarkable  property  of  the  spectra  (i.e.,  the 
characteristic  values  and  characteristic  vectors)  of  positive  matrices.^ 

Theorem  1  (Perron)  :  A  positive  matrix  ^=\\  cbiu  111  always  has  a  real 
and  positive  characteristic  value  r  which  is  a  simple  root  of  the  characteristic 
equation  and  exceeds  the  moduli  of  all  the  other  characteristic  values.  To  this 
*maximaV  characteristic  value  r  there  corresponds  a  characteristic  vector 
z=  (01,  Z2,  . . . ,  Zn)  of  A  with  positive  coordinates  0i  >  0  (i=  1,  2,  . . . ,  w).^ 

A  positive  matrix  is  a  special  case  of  an  irreducible  non-negative  matrix. 
Frobenius''  has  generalized  Perron's  theorem  by  investigating  the  spectral 
properties  of  irreducible  non-negative  matrices. 

Theorem  2  (Frobenius) :  An  irreducible  non-negative  matrix  A  = 
II  (hk  II?  always  has  a  positive  characteristic  value  r  thai  is  a  simple  root  of 
the  characteristic  equation.  The  moduli  of  all  the  other  characteristic  values 
do  not  exceed  r.  To  the  *  maximal'  characteristic  value  r  there  corresponds 
a  characteristic  vector  with  positive  coordinates. 

Moreover,  if  A  has  h  characteristic  values  Ao  =  r,  Ai, . . . ,  Xji-i  of  modulus 
r,  then  these  numbers  are  all  distinct  and  are  roots  of  the  equation 

AA_r*  =  0,  (4) 

More  generally.   The  whole  spectrum  lo,  h,  ■  -  ■ ,  K-i  of  A,  regarded  as  a 
system  of  points  in  the  complex  X-plane,  goes  over  into  itself  under  a  rotation 


4  The  product  of  an  irreducible  non-negative  matrix  and  a  positive  matrix  is  itself 
positive. 

5  See  [316],  [317],  and  [17],  p.  100. 

^  Since  r  is  a  simple  characteristic  value,  the  characteristic  vector  z  belonging  to  it  is 
determined  to  vi^ithin  a  scalar  factor.  By  Perron's  theorem  all  the  coordinates  of  z  are 
r«al,  different  from  zero,  and  of  like  sign.  By  multiplying  0  by  —  1,  if  necessary,  we 
can  make  all  its  coordinates  positive.  In  the  latter  case  the  vector  (column)  z^  (^i,  22, 
03,  ... ,  Zn)  is  called  positive  (as  in  Definition  1). 

7  See  [165]  and  [166]. 
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of  the  plane  by  the  angle  27i/h.    7/  /i  >  1,  then  A  can  he  put  hy  means  of  a 
permutation  into  the  following  ^cyclic'  form: 

0  A^^O  ...  0 
0  0  A^^...  0 
A^\       i,  (5) 

0  0  0     ...  ^,_i,, 
.A^^O  0     ...0 

where  there  are  square  blocks  along  the  main  diagonal. 

Since  Perron's  theorem  follows  as  a  special  case  from  Frobenius' 
theorem,  we  shall  only  prove  the  latter.^  To  begin  with,  we  shall  agree  on 
some  notation. 

We  write 

0  ^  2)  or  D  ^  C, 

where  C  and  D  are  real  rectangular  matrices  of  the  same  dimensions  mX  n 
C  =  \\Ci,\\,D=]\di,\\     (i=l,2,  ...,m;  k  =  l,  2,  . . .,  n), 

if  and  only  if 

%  ^  dij,     (i  =  1,  2,  . . .,  m;  k  =  l,  2,  . . .,  n)  .  (6) 

If  the  equality  sign  can  be  omitted  in  all  the  inequalities  (6),  then  we 
shall  write 

C  <D  or  D>  C. 

In  particular,  C  ^  0  ( >  0 )  means  that  all  the  elements  of  C  are  non- 
negative  (positive). 

Furthermore,  we  denote  by  0+  the  matrix  mod  C  which  arises  from  C 
when  all  the  elements  are  replaced  by  their  moduli. 

2.    Proof  of  Frohenius^  Theorem  :^    Let  x=  {xi,  X2,  .  ■  ■ ,  Xn)   {x  j^  o)  he  a 
fixed  real  vector.    We  set : 

k=  1 

In  the  definition  of  the  minimum  we  exclude  here  the  values  of  i  for  which 
Xi  =  0.    Obviously  r^  ^  0,  and  r^,  is  the  largest  real  number  q  for  which 

QX  ^  Ax. 


^  For  a  direct  proof  of  Perron's  theorem  se©  [17],  p.  100  ff. 
9  This  proof  is  due  to  Wielandt  [384]. 
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We  shall  show  that  the  function  Vx  assumes  a  maximum  value  r  for  some 
vector  z'^o: 

r  =  r2=maxr^  =  max  mm   ^^ — -.  (7) 

From  the  definition  of  r^  it  follows  that  on  multiplication  of  a  vector 
x^o  {xy^o)  by  a  number  >i  >  0  the  value  of  r^  does  not  change.  There- 
fore, in  the  computation  of  the  maximum  of  r^  we  can  restrict  ourselves  to 
the  closed  set  M  of  vectors  x  for  which 

n 

x'^0     and     {xx)  ^  ^  a;?  =  1 . 
t=i 

If  the  function  Vx  were  continuous  on  M,  then  the  existence  of  a  maximum 
would  be  guaranteed.  However,  though  continuous  at  every  '  point '  x  >  o, 
Vj;  may  have  discontinuities  at  the  boundary  points  of  M  at  which  one  of  the 
coordinates  vanishes.  Therefore,  we  introduce  in  place  of  M  the  set  N  of 
all  the  vectors  y  of  the  form 

y=  {E-\-  AY-'^x     {xeM). 

The  set  N,  like  M,  is  bounded  and  closed  and  by  Lemma  1  consists  of 
positive  vectors  only. 

Moreover,  when  we  multiply  both  sides  of  the  inequality 

VxX"^  Ax, 
by  (E  +  A)«-i  >  0,  we  obtain : 

r^y^Ay         (y  =  (E  +  A)"-''x) . 

Hence,  from  the  definition  of  r«  we  have 


Therefore  in  the  computation  of  the  maximum  of  r^  we  can  replace  M 
by  the  set  iV^  which  consists  of  positive  vectors  only.  On  the  bounded  and 
closed  set  N  the  function  r^.  is  continuous  and  therefore  assumes  a  largest 
value  for  some  vector  2;  >  o. 

Every  vector  z^  0  for  which 

r^  =  r  (8) 

will  be  called  extremal. 
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We  shall  now  show  that:  1)  The  number  r  defined  by  (7)  is  positive 
and  is  a  characteristic  value  of  A;  2)  Every  extremal  vector  z  is  positive 
and  is  a  characteristic  vector  of  A  for  the  characteristic  value  r,  i.e., 

r>0,  z>o,  Az  =  rz.  (9) 

n 

For  if  i^=  (1,  1,  . .  .  ,  1),  then  r^=  min   ^  a^^.    But  then  r„  >0,  be- 

cause  no  row  of  an  irreducible  matrix  can  consist  of  zeros  only.  Therefore 
r  >  0,  since  r  ^  r„.    Now  let 

x={E-\-Af-^z.  (10) 

Then,  by  Lemma  1,  ic  >  o.  Suppose  that  Az  —  rzj^  o.  Then  by  (1),  (8), 
and  (10)  we  obtain  successively: 

Az  —  rz'^ o,  {E  +  AY-^  {Az  —  rz)>o,  Ax  —  rx>o. 

The  last  inequality  contradicts  the  definition  of  r,  because  it  would  imply 
that  Ax  —  (r  +  e)x  >  0  for  sufficiently  small  £  >  0,  i.e.,  r^^r  +  e  '^  r. 
Therefore  Az  =  rz.    But  then 

o<x  =  {E  +  A)"^^  z  =  ( 1  +  r)"-i  z, 
so  that  2;  >  0. 

We  shall  now  show  that  the  moduli  of  all  the  characteristic  values  do  not 
exceed  r.    Let 

Ay  =  (xy  {yy^o).  (11) 

Taking  the  moduli  of  both  sides  in  (11),  we  obtain  :^° 

\oc\y+^Ay+.  (12) 

Hence 

\a\^r^+^r. 
Let  y  be  some  characteristic  vector  corresponding  to  r : 

Ay  =  ry     {y^o). 

Then  setting  a  =  r  in  (11)  and  (12)  we  conclude  that  y+  is  an  extremal 
vector,  so  that  y+  >  0,  i.e.,  y=  (yi,  1/2, ... , :?/«),  where  yiy^O  {i=l,  2,...,n). 
Hence  it  follows  that  only  one  characteristic  direction  corresponds  to  the 
characteristic  value  r  ;  for  if  there  were  two  linearly  independent  character- 
istic vectors  z  and  Zi,  we  could  chose  numbers  c  and  d  such  that  the  char- 
acteristic vector  y  =  cz  +  dzi  has  a  zero  coordinate,  and  by  what  we  have 
shown  this  is  impossible. 


Eegarding  the  notation  y  +  ,  see  p.  54. 
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We  now  consider  the  adjoint  matrix  of  the  characteristic  matrix  XE  —  A: 

B  (A)  =  II  £,,  (A)  \\'i  =  A  (A)  (XE  -A)-\ 

where  zl(A)  is  the  characteristic  polynomial  of  A  and  BijdX)  the  algebraic 
complement  of  the  element  ^du  —  Qm  in  the  determinant  A(X).  From  the 
fact  that  only  one  characteristic  vector  z=  {zi,  Z2,  .  .  . ,  Zn)  with  Zi  >  0, 
^2  >  0,  .  .  .  ,  s„  >  0  corresponds  to  the  characteristic  value  r  (apart  from  a 
factor)  it  follows  that  B{r)  y^  0  and  that  in  every  non-zero  column  oi  B{r) 
all  the  elements  are  different  from  zero  and  are  of  the  same  sign.  The  same 
is  true  for  the  rows  of  B(r),  since  in  the  preceding  argument  A  can  be  re- 
placed by  the  transposed  matrix  ^4.""".  From  these  properties  of  the  rows 
and  columns  of  A  it  follows  that  all  the  Budr)  {i,  k  =  l,  2,  .  .  .  ,  n)  are 
different  from  zero  and  are  of  the  same  sign  a.    Therefore 

n 

aA'{r)  =  a2JBii{r)>0, 

i  =  l 

i.e.,  A'(r)  j^O  and  r  is  a  simple  root  of  the  characteristic  equation  A{1)  —  0. 
Since   r  is  the   maximal  root  of   A{X)  =  X^  -\-  .  .  .  ,  A{X)    increases   for 
k^r.    Therefore  zJ'(r)  >  0  and  (7=1,  i.e., 

B,,{r)>0     {i,k  =  l,2,...,n),  (13) 

3.  Proceeding  now  to  the  proof  of  the  second  part  of  Frobenius'  theorem, 
we  shall  make  use  of  the  following  interesting  lemma  :^^ 

Lemma  2:    //  A==  ||  aik  ||i  and  C=  ||  Cijc  ||i   are  two  square  matrices  of 
the  same  order  n,  where  A  is  irreducible  and^^ 

C+^A,  (14) 

then  for  every  characteristic  value  y  of  C  and  the  maximal  characteristic 
value  r  of  A  we  have  the  inequality 

\y\^r.  (15) 

In  the  relation  (15)  the  equality  sign  holds  if  and  only  if 

C  =  e^'PDAD-^,  (16) 

where  e^'f=  y/r  and  D  is  a  diagonal  matrix  whose  diagonal  elements  are  of 
unit  modulus  (D+  =E). 


11  See  [384]. 

^-  C  is  a  complex  matrix  and  A  ^  0. 
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Proof.  We  denote  by  y  a  characteristic  vector  of  C  corresponding  to  the 
characteristic  value  y : 

Cy  =  yy  (y¥=0).  (17) 

From  (14)  and  (17)  we  find 

\y\y+^C-^y^^Ay+.  (18) 

Therefore 

\y\^r^+^r. 

Let  us  now  examine  the  case  |  y  |  =  r  in  detail.  Here  it  follows  from 
(18)  that  y+  is  an  extremal  vector  for  A,  so  that  i/+  >  o  and  that  i/+  is  a 
characteristic  vector  of  A  for  the  characteristic  value  r.  Therefore  the 
relation  (18)  assumes  the  form 

Ay+^C+y+=ry+,     y^>o,  (19) 

Hence  by  (14) 

C-^  =  A,  (20) 

Let  y=(yi,y2,'-',  yn),  where 

2//^|2/;|e»>/         (/=1,2,  ...,7^). 
"We  define  a  diagonal  matrix  D  by  the  equation 

D  =  {e*>se*>S  . . .,  e*>»}. 
Then 

y  =  X)y+. 

Substituting  this  expression  for  y  in  (17)  and  then  setting  y  =  re''f,  we 
find  easily : 

Fy^=ry+  (21) 

where 

F=e-^^D-^CD.  (22) 

Comparing  (19)  with  (21),  we  obtain 

Fy+  =  G+y+=Ay+.  (23) 

But  by  (22)  and  (20) 

F+=C+=A, 
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Therefore  we  find  from  (23) 

Since  i/+  >  o,  this  equation  can  hold  only  if 

F  =  F^, 

i.e., 

Hence 

and  the  Lemma  is  proved. 

4.  We  return  to  Frobenius'  theorem  and  apply  the  lemma  to  an  irreducible 
matrix  A^O  that  has  precisely  h  characteristic  values  of  maximal  modu- 
lus r: 

{0  =  (po<(pi<(p2<"'  < (Ph-i  <2  7l). 

Then,  setting  C  =  A  and  y  =  h  in  the  lemma,  we  have,  for  every  k  =  0, 
l,...,/t-l, 

A=e^'P^D,,AD~\  (24) 

where  Djc  is  a  diagonal  matrix  with  Dt=^  E. 

Again,  let  z  be  a  positive  characteristic  vector  of  A  corresponding  to  the 
maximal  characteristic  value  r: 

Az^=rz      {z>o).  (25) 

Then  setting 

y=D,z      {y-^=z>o),  (26) 

we  find  from  (25)  and  (26)  : 

Ay=hy       a*=re»>*;     k=0,l,...,h-l),  (27) 

The  last  equation  shows  that  the  vectors  y,y, . . .,  y  defined  in  (26)  are 
characteristic  vectors  of  A  for  the  characteristic  values  Xq,  Ai,  .  .  .  ,  Xh-i. 

From  (24)  it  follows  not  only  that  Xo  =  r,  but  also  that  each  character- 
istic value  Ai,  .  .  .  ,  Xn-i  of  A  is  simple.    Therefore  the  characteristic  vectors 

y  and  hence  the  matrices  D^  (/c  =  0,  1,  .  .  . ,  /i  —  1)  are  determined  to  within 
scalar  factors.  To  define  the  matrices  Do,  Di,  .  .  . ,  Dn-\  uniquely  we  shall 
choose  their  first  diagonal  element  to  be  1.    Then  Dq  =  E  and  y  =  z^  o. 
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Furthermore,  from  (24)  it  follows  that 

A=e' (^/ ± ^*) D^Di^ ^ADi" ^Dy''  {j,  k=^0,  I,  . . .,  h-l). 
Hence  we  deduce  similarly  that  the  vector 

is  a  characteristic  vector  of  A  corresponding  to  the  characteristic  value 

Therefore  e*(''yi»'*>  coincides  with  one  of  the  numbers  e*''^  and  the  matrix 
D^Dii"^^  with   the   corresponding   matrix   Dr,   that  is,   we   have,  for  some 

g*  i<pj+<Pk)  =  e*>i,        e«  (jpj—^k)  =  ^ivi^ 

Thus :  The  numbers  €*''»,  e*''s  .  .  .  ,  e^^'A-i  and  the  corresponding  diagonal 
matrices  Do,  Di,  .  .  . ,  Dji^i  form  two  isomorphic  midtiplicative  ahelian 
groups. 

In  every  finite  group  consisting  of  h  distinct  elements  the  h-th  power 
of  every  element  is  equal  to  the  unit  element  of  the  group.  Therefore 
€*''»,  e*''s  .  . .,  e*''*-i  are  h-th  roots  of  unity.  Since  there  are  h  such  roots  of 
unity  and  (pQ=  0  < 9?]  < 9?a  <  •*•  <9'a-i<2^  5 

%  =  ^        {k  =  0,l,2,...,h-l) 

and 

.271 

ei'Fk-^  gi       (e  =et>x  ^^'  h  .     j^^q,  1,  . . .,  h  —  l),  (28) 

A^^re*     (A;  =  0,  1,  ...,  A-1).  (29) 

The  numbers  Xo,  Ai,  .  .  .  ,  A;;_i  form  a  complete  system  of  roots  of  (4). 
In  accordance  with  (28),  we  have:^^ 

i)^  =  7)*       (2)  =  Z)i;  A;=0,  1,  ...,^-1).  (30) 

The  equation  (24)  now  gives  us  (for  A:  =  l)  : 

.2« 

A  =  e^DAD-^.  (31) 


^■*  Here  Ave  use  the  isomorphism  of  the  multiplicative  groups    e*>o,  e*9'i, . . .  ,e*'9'ft-i  and 
Do,  Z)i,  .  .  . ,  Dh~i. 
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pi 
Hence  it  follows  that  the  matrix  A  on  multiplication  by  e'  *  goes  over  into  a 
similar  matrix  and,  therefore,  that  the  whole  system  of  n  characteristic 

271 

values  of  A  on  multiplication  hy  e*  a  goes  over  into  itself. '^^ 
Further, 

so  that  all  the  diagonal  elements  of  D  are  h-ih.  roots  of  unity.  By  a  permuta- 
tion of  A  (and  similarly  of  D)  we  can  arrange  that  D  be  of  the  following 
quasi-diagonal  form : 

D  =  {  rjoEo,  r],E„  . . .,  r?,_iJS;,_i },  (32) 

where  Eo,  Ei,  . . . ,  Es-i  are  unit  matrices  and 

rjj,=  e'yp,  y^j,=  n^j^ 

{up  is  an  integer  ;p  =  0,  1,  ...,s  —  l;0<Wo<...<  ns-i  <  h). 

Obviously  s  ^  h. 

Writing  A  in  block  form  (in  accordance  with  (32)) 

-^11      -^12    •  •  •  ^15' 

4=1^"     ^''■■■/^^'\,  (33) 

'^■'^jri        ■^*2     •  •  •    ^ssJ 

we  replace  (31)  by  the  system  of  equations 

^'^P^'^'l^i^r,        Ug=l,2,  ...,s;e  =  e'^)  (3^) 

Hence  for  every  p  and  q  either  ^^^  =  e  or  Apn  =  0. 

Vp—^ 

Let  us  take  p  =  l.    Since  the  matrices  A12,  A^s,  .  .  . ,  Au  cannot  vanish 

simultaneously,  one  of  the  numbers  —,—,...,  -^^  (i^o  =  1)  must  be  equal 

to  e.    This  is  only  possible  for  ni  =  1.     Then-^  =  e  and  An  =  JL13  =  .  .  .= 

Ais  =  0.  Setting  p  =  2  in  ( 34 ) ,  we  find  similarly  that  ti2  =  2  and  that  A21  = 
A22  —  A.24  =  .  .  .  —  A.2S  ==  0,  etc.    Finally,  we  obtain 


^^  The  number  ^  is  the  largest  integer  having  these  properties,  because  A  has  precisely 
li  characteristic  values  of  maximal  modulus  r.  Moreover,  it  follows  from  (31)  that  all 
the  characteristic  values  of  the  matrix  fall  into  systems  (with  h  numbers  in  each)  of  the 
form  /<Q,  fiffif  '  •  '  f  IJLqE^ — ^  and  that  within  each  such  system  to  any  two  characteristic 
values  there  correspond  elementary  divisors  of  equal  degree.  One  such  system  is  formed 
by  the  roots  of  the  equation  (4)  "ko,  Xi,  .  .  . ,  hi-x. 
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0     Ai2  0     ...  0 


0     0     A..  ...  0 


23 


0      0     0    ...  A,^ 

Aji  As2  Ag^  . . .  Agg 


l,s 


Here  ni  =  l,  n2  =  2,  ...,  ?is_i  =  s  —  1.     But  then  for  p  =  s  on  the  right- 
hand  sides  of  (34)  we  have  the  factors 


Vs-l 


^'  (g=l,2,  ...,  s). 


.2ff 

One  of  these  numbers  must  be  equal  to  e  =  e*  a  .    This  is  only  possible  when 
s  =  h  and  q  =  l;  consequently,  As2  =  .  .  .  =  Agg  =  0. 
Thus, 

D  =  [E^,  8E„  e^E^,  ...,  ^^Ej,_^) , 

and  the  matrix  A  has  the  form  ( 5 ) . 

Frobenius'   theorem  is  now  completely  proved. 

5.    We  now  make  a  few  general  comments  on  Frobenius '  theorem. 

Remark  1.  In  the  proof  of  Frobenius'  theorem  we  have  established 
incidentally  that  for  an  irreducible  matrix  A^O  with  the  maximal  charac- 
teristic value  r  the  adjoint  matrix  B{X)  is  positive  for  A  =  r : 

B{r)>0,  (35) 

i.e., 

A.W>0         (i,h  =  l,2,  ...,n\  (35') 

where  Bijc{r)  is  the  algebraic  complement  of  the  element  vSm  —  Oj^i  in  the 
determinant  |  rE  —  A  | . 

Let  us  now  consider  the  reduced  adjoint  matrix  (see  Vol.  I,  Chapter  IV, 
§6) 

B{X) 


CW 


Bn-l  (A)  ' 


where  D„-i(X)  is  the  greatest  common  divisor  of  all  the  polynomials  Bik(X) 
{i,k  =  l,2,  .  .  .,n).  It  follows  from  (35')  that  Z)„_i(r)  7^0.  All  the  roots 
of  Dn-ii^)  are  characteristic  values^^  distinct  from  r.     Therefore  all  the 


^^  Dn-i(X)  is  a  divisor  of  the  characteristic  polynomial  DnCk)  =  |  kE  —  A 
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roots  of  Dn-i(^)  either  are  complex  or  are  real  and  less  than  r.     Hence 
Dn-i{r)  >  0  and  this,  in  conjunction  with  (35),  yields :^^ 


C{r) 


B{r) 


Dn-i  (r) 


>0 


(36) 


Remark  2.    The  inequality  (35')  enables  us  to  determine  bounds  for  the 
maximal  characteristic  value  r. 
We  introduce  the  notation 

n 

8i  =  ^  Qijg      {i  =  1,2,...,  n),      5  —  min  5^,      S  =  max  Sf. 


Then :  For  an  irreducible  matrix  A^  0 

s<:r^S, 


(37) 


and  the  equality  sign  on  the  left  or  the  right  of  r  holds  for  s  =  S  only;  i.e., 
holds  only  when  all  the  ^row-sums'  Si,  S2,  .  .  . ,  Sn  are  equal.'^^ 

For  if  we  add  to  the  last  column  of  the  characteristic  determinant 


A(r)  = 


r  — a,i       —  ai2 


—  a 


*nl 


—  a. 


■n2 


r  —  a^ 


all  the  preceding  columns  and  then  expand  the  determinant  with  respect  to 
the  elements  of  the  last  column,  we  obtain : 


A=l 


Hence  (37)  follows  by  (35'). 

Remark  3.  An  irreducible  matrix  A^  0  cannot  have  two  linearly  inde- 
pendent non-negative  characteristic  vectors.  For  suppose  that,  apart  from 
the  positive  characteristic  vector  2;  >  0  corresponding  to  the  maximal  charac- 
teristic value  r,  the  matrix  A  has  another  characteristic  vector  y  ^  0  (lin- 
early independent  of  z)  for  the  characteristic  value  a: 


1^  In  the  following  section  it  will  be  shown  for  an  irreducible  matrix  B(X)  >  0,  that 
C(>.)  >  0  for  every  real  l^r. 

1®  Narrower  bounds  for  r  than  (s,  S)  are  established  in  the  papers  [256],  [295]  and 
[119,  IV]. 
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Since  r  is  a  simple  root  of  the  characteristic  equation  \XE  —  A  |  =  0, 

We  denote  by  u  the  positive  characteristic  vector  of  the  transposed  matrix 
A7  for  A  =  r : 

A^u  =ru     (u>  o) . 
Then^» 

r  {y,  u)  =  (2/,  A^u)  =  {Ay,u)  =  a  (y,  u) ; 

hence,  as  a  7^  r, 

{yyu)  =  o, 

and  this  is  impossible  for  W^  o,y^o,y  ^0. 

Remark  4.  In  the  proof  of  Frobenius'  Theorem  we  have  established  the 
following  characterization  of  the  maximal  characteristic  value  r  of  an  irre- 
ducible matrix  A^O : 

r  =  maxr^, 

where  r^.  is  the  largest  number  q  for  which  qx  ^  Ax.    In  other  words,  since 

r^  =  min  ^— ^ ,  we  have 

'^*^"    ""'  .        {Ax), 

r  =  max    mm     ^^ — -. 

Similarly,  we  can  define  for  every  vector  x'^o  {x^  0)  a  number  r^  as  the 
least  number  o-  for  which 

Gx  ^  Ax  ; 

i.e.,  we  set  ,  ^  ^ 

{Ax)i 
f=   max   ^^ — -, 

If  for  some  i  we  have  here  Xi  =  0,  (Ax)i^O,  then  we  shall  take  r^  =  +co. 

As  in  the  case  of  the  function  r^,  it  turns  out  here  that  the  function  r^ 
assumes  a  least  value  r  for  some  vector  v  >  a. 

Let  us  show  that  the  number  r  defined  by 

r  =  niinr*  =  min     max   - — -  (38) 


19  If  2/  =  (1/1,  2/2,  ... ,  2/n)  and  w  =  (ui,  U2,  . . . ,  Wn),  then  we  mean  by  (y,  u)  the  *  scalar 
n 
product'  y^u^  2^yiUi.  Then  (1/,  ^"""w)  =  2/"^^^w  and   (^2/,  w)  =  (^3/)"^^  =  i/'^^'''w. 
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coincides  with  r  and  that  the  vector  v'^o  {v  y^  o)  for  which  this  minimum 
is  assumed  is  a  characteristic  vector  of  A  for  X  =  r. 
For, 

rv  —  Av^o     (v^o,    Vy^o). 

Suppose  now  that  the  sign  ^  cannot  be  replaced  by  the  equality  sign.    Then 
by  Lemma  1 

{E  +  ^)"-^  (rv  —  Av)  >  0,     (E  +  A)"*-^  v  >  o.  (39) 

Setting 

u={E  +  A}''-'^v>o, 

we  have 

ru>  Au 

and  so  for  sufficiently  small  e  >  0 

{r  —  e)u>Au     (w>o), 

which  contradicts  the  definition  of  r.    Thus 

Av=  rv. 

But  then  ^ 

u  =  {E-\-  A)"-^v  =  (1  +  r)»-iv. 

Therefore  u^o  implies  that  t;  >  o. 
Hence,  by  the  Remark  3, 

r  =  r. 
Thus  we  have  for  r  the  double  characterization : 


(Ax)i  .  {Az)i 

r  =  max     mm     ^ — -  =  mm     max     — — 


(40) 


Moreover  we  have  shown  that  max  or  min  is  only  assumed  for  a  positive 

(X  ^  0)         (x^  o) 

characteristic  vector  for  X  =  r. 

From  this  characterization  of  r  we  obtain  the  inequality^° 

xnin   ^'^r^max  ^*     {x^o,  Xy^o).  (41) 

Remark  5.     Since  in  (40)  max  and  min  are  only  assumed  for  a  posi- 
es ^  O)  («  ^  0) 

tive  characteristic  vector  of  the  irreducible  matrix  A^O,  the  inequalities 


20  See  [128]  and  also  [17],  p.  325  ff. 
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rz  ^  Az,  2:  ^  0,  Zy^o 
or 

rz  ^  Az,  z'^o,  z^o 
always  imply  that 

Az  =  rz,  0  >  0. 

§  3.    Reducible  Matrices 

1.  The  spectral  properties  of  irreducible  non-negative  matrices  that  were 
established  in  the  preceding  section  are  not  preserved  when  we  go  over  to 
reducible  matrices.  However,  since  every  non-negative  matrix  A^O  can 
be  represented  as  the  limit  of  a  sequence  of  irreducible  positive  matrices  Am 

^=lim^^     (^^>0,  m  =  l,2,  ...),  (42) 

some  of  the  spectral  properties  of  irreducible  matrices  hold  in  a  weaker  form 
for  reducible  matrices. 

For  an  arbitrary  non-negative  matrix  A=  11  tti^  ||i  we  can  prove  the 
following  theorem : 

Theorem  3:  A  non-negative  matrix  A=  11  aijc  ||7  always  has  a  non- 
negative  characteristic  value  r  such  that  the  moduli  of  all  the  characteristic 
values  of  A  do  not  exceed  r.  To  this  ^maximal'  characteristic  value  r  there 
corresponds  a  non-negative  characteristic  vector 

Ay  =  ry  {y^o,  yy^o). 

The  adjoint  matrix  B(X)  =  11  Bud^)  ||i  =  (XE  —  A)-'^A(X)  satisfies  the 
inequalities 

B(X)^0,    -IjB(X)^0     for    A^r.  (43) 

Proof.  Let  A  be  represented  as  in  (42).  We  denote  by  r^^^  and  y^**^^ 
the  maximal  characteristic  value  of  the  positive  matrix  Am  and  the  corre- 
sponding normalized^^  positive  characteristic  vector : 

^^2/^'«>  =  r('»>y<'»>     ((2/<"'>,2/^'">)  =  l,  y("»)>o;  w=l,  2,  ...)•  (44) 

Then  it  follows  from  (42)  that  the  limit 

lim  r^*")  =  r 


21  By  a  normalized  vector  we  mean  a  column  y  =  (yi,  2/2,  . . . ,  2/n)  for  which  (y,y)  ^ 

i:  y]  =  i. 
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exists,  where  r  is  a  characteristic  value  of  A.  From  the  fact  that  r^^^  >  0 
and  r^*")  >  |  Ao^*"^  |,  where  Ao^*"^  is  an  arbitrary  characteristic  value  of  Am 
(m  =  1,  2,  .  .  .),  we  obtain  by  proceeding  to  the  limit : 

where  Ao  is  an  arbitrary  characteristic  value  of  A.  This  passage  to  the  limit 
gives  us  in  place  of  (35) 

B{r)^0.  (45) 

Furthermore,  from  the  sequence  of  normalized  characteristic  vectors  y^^^ 
(m  =  l,  2,  .  . .)  we  can  select  a  subsequence  y^"^"*  (p  —  1,  2,  .  .  .)  that  con- 
verges to  some  normalized  (and  therefore  non-zero)  vector  y.  When  we  go 
to  the  limit  on  both  sides  of  (44)  by  giving  to  m  the  values  mp  {p  =  l,2,  .  .  .) 
successively,  we  obtain: 

Ay=ry     {y>o,yy^o). 

The  inequalities  (43)  will  be  established  by  induction  on  the  order  n. 
For  w  =  1,  they  are  obvious. ^^  Let  us  establish  them  for  a  matrix  A  =  ||  Oijc  11 1 
of  order  n  on  the  assumption  that  they  are  true  for  matrices  of  order  less 
than  n. 

Expanding  the  characteristic  determinant  A(X)  =  \  XE  —  A  |  with  re- 
spect to  the  elements  of  the  last  row  and  the  last  column,  we  obtain : 

n— 1 

A  (A)  =  (A-0^n.(^)-  2!  4?W«i«««fc .  (46) 

Here  5„„  {X)  =  \  Xd^^  —  a^jj.  |  Jf"^  is  the  characteristic  determinant  of  a  'trun- 
cated' non-negative  matrix  of  order  n  —  1,  and  B^S{^)  is  the  algebraic 
complement  of  the  element  Xdi^  —  (kk  in  -B„„(>^)  {i,k  =  \,2,  .  .  .  ,n  —  1) .  The 
maximal  non-negative  root  of  BnnW  will  be  denoted  by  r„.  Then  setting 
A  =  r^  in  (46)  and  observing  that  by  the  induction  hypothesis 

^uiO'^O         (i,A;=l,2,...,7i-l), 

we  obtain  from  (46)  : 

^(r„)^0. 

On  the  other  hand  Zl(>l)  =  A"  +  .  .  .  ,  so  that  zl(  +  oo)  =  +  oo.  Therefore 
nn  either  is  a  root  oi  A{X)  or  is  less  than  some  real  root  oi  A{X).  In  both 
cases. 


22  For  since  B{\)  =  (XE  —  A) -""J (l) ,  we  have  B(X)  =^,  ^B(X)  =  0  for  7i  =  l. 


68  XIII.    Matrices  with  Non-Negative  Elements 

Since  every  principal  minor  Bjj{X)  of  order  n  —  1  can  be  brought  into 
the  position  of  BnnW  by  a  permutation  of  A,  v^e  have 

r^^r         0':=  1,2,...,  71),  (47) 

where  Vj  denotes  the  maximal  root  of  the  polynomial  Bjj{X)  (j  =  1,  2, . . . ,  ti)  . 
Furthermore,  BiuW  may  be  represented  as  a  minor  of  order  n  —  1  of 
the  characteristic  matrix  XE  —  A,  multiplied  by  ( —  1)*+^.    When  we  differ- 
entiate this  determinant  with  respect  to  X,  we  obtain : 

-^  Bi,  W  =  ^  B^  {X)         (i,  A;  =  1,  2,  . . .,  71  - 1),  (48) 

where  B^^  W  =  \\  Bit  \\  {iy^  j,  ky^  j ;  j  =  l,  2,  .  .  . ,  n)  is  the  adjoint  matrix 
of  the  matrix  ||  aik  11  {i,k  =  l,  2,  .  .  .  ,  j  —  1,  j  +  1,  .  .  . ,  n)  of  order  n  —  1 . 
But,  by  the  induction  hypothesis, 

50-)(A)^0     for     X^rj     U  =  l,2,...,n); 

and  so,  by  (47)  and  (48), 

'^^B{X)^0     for     X^r.  (49) 

From  (45)  and  (49)  it  follows  that 

B{X)-^0     for     X^r. 

The  proof  of  the  theorem  is  now  complete. 

Note.  In  the  passage  to  the  limit  (42)  the  inequalities  (37)  are  pre- 
served. They  hold,  therefore,  for  an  arbitrary  non-negative  matrix.  How- 
ever, the  conditions  under  which  the  equality  sign  holds  in  (37)  are  not 
valid  for  a  reducible  matrix. 

2.    A  number  of  important  propositions  follow  from  Theorem  3  : 

1.  If  A=  II  Oiic  II 1  is  a  non-negative  matrix  with  maximal  characteristic 
value  r  and  C(X)  is  its  reduced  adjoint  matrix,  then 

CiX)^0     for     X^r.  (50) 


For 
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^^^^-^5Sa)' 
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(51) 


where  Z)„_i(A)   is  the  greatest  common  divisor  of  the  elements  of  B(A). 
Since  Z)„_i(A)  divides  the  characteristic  polynomial  A{X)  and  Z>n-i(A)  = 

x--^  +  . . . , 

Dn-iW  >0     for     X>r.  (52) 

Now  (43),  (51),  and  (52)  imply  (50). 

2.    If  A  ^  0  is  an  irreducible  matrix  with  maximal  characteristic  value 
r,  then 

B(X)  >0,  CW  >0     for    X^r.  (53) 


Indeed,  by  (35)  5(r)  >  0.    But  also  (see  (43) )  —BW  ^  0  for  A  ^  r. 
Therefore 


BW>0     for     X^r. 


(54) 


The  other  of  the  inequalities  (53)  follows  from  (51),  (52),  and  (54), 

3.  If  A^O  is  an  irreducible  matrix  with  maximal  characteristic  value 
r,  then 

{XE~A)-^>0     for    X>r.  (55) 

This  inequality  follows  from  the  formula 

since  B{X)  >  0  and  A{X)  >  0  f or  A  >  r. 

4.  The  maximal  characteristic  value  r'  of  every  principal  minor^^  {of 
order  less  than  n)  of  a  non-negative  matrix  A=\\  a^  lii  does  not  exceed 
the  maximal  characteristic  value  r  of  A: 


r^  ^r. 


(56) 


If  A  is  irreducible,  then  the  equality  sign  in  (56)  cannot  occur. 
If  A  is  reducible,  then  the  equality  sign  in  (56)  holds  for  at  least  one 
principal  minor. 


23  We  mean  here  by  a  principal  minor  the  matrix  formed  from  the  elements  of  a  prin- 
cipal minor. 
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For  the  inequality  (56)  is  true  for  every  principal  minor  of  order  n  —  1 
(see  (47)).  If  A  is  irreducible,  then  by  (35')  Bjj{r)  >  0  ij  =  l,2,  .  .  . ,  n) 
and  therefore  r^  ^r. 

By  descent  from  n  —  1  to  n  —  2,  from  n  —  2  to  n  —  3,  etc.,  we  show 
the  truth  of  (56)  for  the  principal  minors  of  every  order. 

If  A  is  a  reducible  matrix,  then  by  means  of  a  permutation  it  can  be 
put  into  the  form 


=(:  3- 


Then  r  must  be  a  characteristic  value  of  one  of  the  two  principal  minors  B 
and  D.    This  proves  Proposition  4. 
From  4.  we  deduce : 

5.    If  A^  0  and  if  in  the  characteristic  determinant 


A(r)  = 


r  —  a 


11 


ai2 


'In 


—  a^i  r  — ^22  •  •  •     — «2»i 


—  «nl       — «n2    •   •  •     r—a^ 


any  principal  minor  vanishes  {A  is  reducible!),  then  every  ^augmented' 
principal  minor  also  vanishes;  in  particular,  so  does  one  of  the  principal 
minors  of  order  n  —  1 

-Sii(A),  B22(A),   .  .  .  ,  Bnn(^)- 

From  4.  and  5.  we  deduce : 

6.  A  matrix  A^  0  is  reducible  if  and  only  if  in  one  of  the  relations 

5,,(r)^0        {i  =  l,2,  ...,n) 

the  equality  sign  holds. 
From  4.  we  also  deduce : 

7.  If  r  is  the  maximal  characteristic  value  of  a  matrix  A^O,  then  for 
every  X^r  all  the  principal  minors  of  the  characteristic  matrix  A\^XE  —  A 
are  positive : 

AA}  *2  •••  M^^     ^^^^.  l^*i<i2<...<*p^n;^  =  l,2,...,?i).    (57) 

\*1    *2    •  •  •    V 


It  is  easy  to  see  that,  conversely,  (57)  implies  that  A  >  r.    For 
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n 

A{X  +  ix)  =  \{^X  +  ^)E-A\  =  \Ax  +  [JiE\  =  2^  Sj.iJiJ'-^ 

where  Su  is  the  sum  of  all  the  principal  minors  of  order  k  of  the  character- 
istic matrix  A\^XE  —  A  (/c  =  1,  2,  .  .  .  ,  n)}^  Therefore,  if  for  some  real 
X  all  the  principal  minors  of  A\  are  positive,  then  for  ^  ^  0 

i.e.,  no  number  greater  than  /I  is  a  characteristic  value  of  A.    Therefore 

r  <  A. 

Thus,  (57)  is  a  necessary  and  sufficient  condition  for  X  to  be  an  upper 
bound  for  the  moduli  of  the  characteristic  values  of  A}^  However,  the 
inequalities  (57)  are  not  all  independent. 

The  matrix  XE  —  A  is  a  matrix  with  non-positive  elements  outside  the 
main  diagonal.^^  D.  M.  Kotelyanskii  has  proved  that  for  such  matrices,  just 
as  for  symmetric  matrices,  all  the  principal  minors  are  positive,  provided 
the  successive  principal  minors  are  positive.  ^^ 

Lemma  3  (Kotelyanskii)  :  If  in  a  real  matrix  G=  11  ^ifc  p  all  the  non- 
diagonal  elements  are  negative  or  zero 

guc^O         {iy^k',i,k  =  l,2,...,n)  (58) 

and  the  successive  principal  minors  are  positive 

then  all  the  principal  minors  are  positive  : 

^/h  *2  •••  *p\      ^         (l^i\<t2<...<ip^n;  2?  =1,2,  ...,%). 
\h  *2  •  •  •  V 


24  See  Vol.  I,  p.  70. 

25  See  [344]. 

26  It  is  easy  to  see  that,  conversely,  every  matrix  with  negative  or  zero  non-diagonal 
elements  can  be  represented  in  the  form  XE  —  A,  where  ^  is  a  non-negative  matrix  and  X 
is  a  real  number. 

27  See  [215].  This  paper  contains  a  number  of  results  about  matrices  in  which  all  the 
non-diagonal  elements  are  of  like  sign. 
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Proof.    We  shall  prove  the  lemma  by  induction  on  the  order  n  of  the 
matrix.    For  w  =  2  the  lemma  holds,  since  it  follows  from 

9^12  ^0     9^21  ^  0,     grii  >  0,     ^119^22  —  9^129^21  >  0 

that  ^22  >  0.  Let  us  assume  now  that  the  lemma  is  true  for  matrices  of 
order  less  than  n;  we  shall  then  prove  it  for  (r=  ll  QiTc  11 1 .  We  consider  the 
bordered  determinants 

From  (58)  and  (59)  it  follows  that 

tiic^O         {iy^k;i,k  =  2,  ...,n). 

On  the  other  hand,  by  applying  Sylvester's  identity  (Vol.  I,  Chapter  II, 
(30),  p.  33)  to  the  matrix  T=  11  Uk  IIS  ,  we  obtain: 


\h  H  ...  V 


^^''^  \l     i,     i,     ...     ij  \       p=l,2,    ...,71-1         / 

Hence  it  follows  by  (59)  that  the  successive  principal  minors  of  the  matrix 
T=  II  tijc  p   are  positive: 

/2\  (2    3\  (2  3   ...  n\ 

Thus,  the  matrix  ^=  ||  tijc  p  of  order  n  —  1  satisfies  the  condition  of 
the  lemma.  Therefore  by  the  induction  hypothesis  all  the  principal  minors 
are  positive : 

y/h  ^2  ...  M^ ^     {2^ij^<i^<"'<ij,^n;p  =  l,2,..,,n-l). 
\h  *2  .  • .  *p/ 

But  then  it  follows  from  (60)  that  all  the  principal  minors  of  G  containing 
the  first  row  are  positive : 


«(;::;::::::)>»  -'■ 


<i2<"'  <L^n;p  =  1,2,  ...,n  — I),     (61) 
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Let  us  choose  fixed  indices  ii,  i2,  .  .  . ,  in -2  (where  1  <  ti  <  12  <  ■  •  •  < 

in-2  ^  ^)  and  form  the  matrix  of  order  n  —  1 : 

llsr^^ll         (a,)3  =  l,ti,i2,...,  i„_2).  (62) 

The  successive  principal  minors  of  this  matrix  are  positive,  by  (61)  : 

,,,>o.G(;M>o.....«r>'— H>o; 

\1    HI  \1    li  l2   ...   t„_2/ 

and  the  non-diagonal  elements  are  non-positive : 

ga^^O        {cty^P;  a, /3=l,ii,  1*2,  ...,  K-2)' 

But  the  order  of  (62)  is  n  —  1.  Therefore,  by  the  induction  hypothesis,  all 
the  principal  minors  of  this  matrix  are  positive ;  in  particular, 

g(''''"''']>0  (63) 

{2^ii<i^<  '"  <ip^n',  p=l,2,  ...,  w  — 2). 

Thus,  all  the  minors  of  G  of  order  not  exceeding  n  —  2  are  positive. 

Since  by  (63)  5^22  >  0,  we  may  now  consider  the  determinants  of  order 
two  bordering  the  element  g22  (and  not  gn  as  before)  : 


/2  i 
'^         \2  k 


\         (i,  A;--l,  3,  ...,n). 


By  operating  with  the  matrix  T*  =  ||  tf^  \\,  as  we  have  done  above  with  T, 
we  obtain  inequalities  analogous  to  (61)  : 

2  h  . . .  ip 


\2h...  ij  (64) 

{ii<i2<  •••  <ip;     h,  .-.,  ip=l,^,  ...,  w;     p=l,2,  ...,  n  —  l). 

Since  every  principal  minor  oi  G  ^=  \\  Qijc  11 1  contains  either  the  first  or 
the  second  row  or  is  of  order  not  exceeding  n  —  2,  it  follows  from  ( 61 ) ,  ( 63 ) , 
and  (M)  that  all  the  principal  minors  of  A  are  positive.  This  completes 
the  proof  of  the  lemma. 

This  lemma  allows  us  to  retain  only  the  successive  principal  minors  in 
the  condition  (57)  and  to  formulate  the  following  theorem: 


28  See  [344]  and  [215].    Since  C  =  A  —  IE  and  A  ^  0,  In  is  real  (this  follows  from 
Xn-\-'k=-r)   and  the  corresponding  characteristic  vector  of  C  is  non-negative:  C^  ^  Xn^/ 

(2/^0,  1/v^o). 
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Theorem  4 :    A  real  number  X  is  greater  than  the  maximal  characteristic 
value  r  of  the  matrix  A  ^  \\  aik  \\i  ^  0 

r  <  A 

if  and  only  if  for  this  value  l  all  the  successive  principal  minors  of  the  char- 
acteristic matrix  Ax^kE  —  A  are  positive : 


A  —  a^i       —  a 


1-21 


12 


''22 


>0. 


«11 


[-21 


^12 
^22 


—  f^h 


'2n 


—  a„i      — a„2  ...  A  — a„ 


>0.       (65) 


Let  us  consider  one  application  of  Theorem  4.  Suppose  that  in  the  matrix 
C=  II  Cik  II 1  all  the  non-diagonal  elements  are  non-negative.  Then  for  some 
^  >  0  v^e  have  A  =  C  +  XE  ^0.  We  arrange  the  characteristic  values  Xi 
(i=l,  2,  .  .  .  ,  n)  of  C  with  their  real  parts  in  ascending  order: 

Re  Ai  ^  Re  ^2  ^  . . .  ^  Re  A„. 

We  denote  by  r  the  maximal  characteristic  value  of  A.  Since  the  charac- 
teristic values  of  A  are  the  sums  Xi  +  X  {i  =  l,2,...,n),we  have 

Xn  +  X  =  r. 

In  this  case  the  inequality  r  <  A  holds  for  A„  <  0  only,  and  signifies  that  all 
the  characteristic  values  of  C  have  negative  real  parts.  When  we  write  down 
the  inequality  ( 65 )  for  the  matrix  —  C  ^=  XE  —  A,  we  obtain  the  following 
theorem : 

Theorem  5 :  The  real  parts  of  all  the  characteristic  values  of  a  real 
matrix  C=  \\  Ciu  ||i  with  non-negative  non-diagonal  elements 

Cifc  ^  0  {ij^k;  i,k  =  1,2,  ...,  n) 

are  negative  if  and  only  if 


Cn<0, 


Cll 


12 


C21       C22 


>0 


(-1)' 


'11 


a2 


^21     ^22 


Cl„ 
C-Zn 


'nl 


^n2 


...  C. 


>0 


(66) 


§  4.    The  Normal  Form  of  a  Reducible  Matrix 

By  means  of  a 


1.    We  consider  an  arbitrary  reducible  matrix  A 
permutation  we  can  put  it  into  the  form 


(kkWi 
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where  B  and  D  are  square  matrices. 

If  one  of  the  matrices  B  or  D  is  reducible,  then  it  can  also  be  represented 
in  a  form  similar  to  (67),  so  that  A  then  assumes  the  form 


If  one  of  the  matrices  K,  L,  M  is  reducible,  then  the  process  can  be  con- 
tinued. Finally,  by  a  suitable  permutation  we  can  reduce  A  to  triangular 
block  form 


^^11      0    ...   0 

■"21     -^22   •  •  •     ^ 


iAgi     As2  '  •  •  Aggj 


(68) 


where  the  diagonal  blocks  are  square  irreducible  matrices. 
A  diagonal  block  Au  (1  ^  ^  ^  s)  is  called  isolated  if 

A,,  =  0     {k=l,2,...,i-l,i  +  l,...,s). 

By  a  permutation  of  the  blocks  (see  p.  50)  in  (68)  we  can  put  all  the 
isolated  blocks  in  the  first  places  along  the  main  diagonal,  so  that  A  then 
assumes  the  form 

...  0  0.. 

...  0  0.. 


A=\      O  0  ...  A^        0  ...   0      I:  (69) 

•  •   -^g+l,  ff    -^(7+1        •  • 


•  •  •    A^g  '^s,g+l    •  • 

here  Ai,  A2,  . . . ,  Ag  are  irreducible  matrices,  and  in  each  row 
Afv  ^/2»  •  •  •.  ^/,/-i       if  =9+  1,  . . .,  «) 

at  least  one  matrix  is  different  from  zero. 

We  shall  call  the  matrix  (69)  the  normal  form  of  the  reducible  matrix  A. 
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Let  us  show  that  the  normal  form  of  a  matrix  A  is  uniquely  determined 
to  within  a  permutation  of  the  blocks  ayid  permutations  within  the  diagonal 
blocks  (the  same  for  rows  and  columns). ^^  For  this  purpose  we  consider 
the  operator  A  corresponding  to  A  in  an  7i-dimensional  vector  space  R.  To 
the  representation  of  A  in  the  form  (69)  there  corresponds  a  decomposition 
of  R  into  coordinate  subspaces 

R  =  Ri  +  R2  +  . . .  +  R^  +  R^+i  +  . . .  +  Rs ;  (70) 

here  Rg,  Rs-i  +  Rs,  Rs-2  +  Rs-i  +  Rs,  •  •  •  are  invariant  coordinate  subspaces 
for  A,  and  there  is  no  intermediate  invariant  subspace  between  any  two 
adjacent  ones  in  this  sequence. 

Suppose  then  that  apart  from  the  normal  form  (69)  of  the  given  matrix 
there  is  another  normal  form  corresponding  to  another  decomposition  of  R 
into  coordinate  subspaces : 

R  =  Ri  +  R2  +  . . .  +  R^  +  R^+i  +  ...-{- Rf.  (71) 

The  uniqueness  of  the  normal  form  will  be  proved  if  we  can  show  that  the 

decompositions  (70)  and  (71)  coincide  apart  from  the  order  of  the  terms. 

^^ 
Suppose  that  the  invariant  subspace  Rt  has  coordinate  vectors  in  com- 

/% 
mon  with  Rjc,  but  not  with  Rfc+i,  .  .  .  ,  Rs.     Then  Rt  must  be  entirely  con- 

/\ 
tained  in  R^,  since  otherwise  Rt  would  contain  a  'smaller'  invariant  sub- 

/\  /\ 

space,  the  intersection  of  Rt  with  R^  +  R^+i  +  . .  .  +  Rs.    Moreover,  Rt  must 

coincide  with  R^,  since  otherwise  the  invariant  subspace  Rt  +  Rh+i  + .  .  .  +Rs 
would  be  intermediate  between  R^  +  R^+i  +  .  .  .  +  Rs  and  Rk+i  +  .  .  .  +  Rs- 

Since  Rfc  coincides  with  Rt,  Rk  is  an  invariant  subspace.  Therefore,  without 
infringing  the  normal  form  of  the  matrix,  R^  can  be  put  in  the  place  of  Rs. 

Thus,  we  may  assume  that  in  (70)  and  (71)  Rs  ^  R^ 

/\ 
Let  us  now  consider  the  coordinate  subspace  Rt-i.    Suppose  that  it  has 

coordinate  vectors  in  common  with  Ri  (?  <  s),  but  not  with  Ri+i,  .  . . ,  R^. 

Then   the   invariant   subspace   R^-i  +  R^   must   be   entirely   contained   in 

Rj  +  Rj+i  +  . . .  +  Rs,  since  otherwise  there  would  be  an  invariant  coordinate 

/\  /\  /%  /\ 

subspace  intermediate  between  Rt  and  R^-i  +  R^.     Therefore  R^_iCRi. 

Moreover  Rt-i  ^  Ri,  since  otherwise  Rt-i  +  Ri+i  +  . .  .  +  Rs  would  be  an 

invariant  subspace  intermediate  between  Rj  +  Rj+i  +  . .  .  +  Rs  and  R^+i  + 


29  Without  violating  the  normal  form  we  can  permute  the  first  g  blocks  arbitrarily 
among  each  other.  Moreover,  sometimes  certain  permutations  among  the  last  s  —  g  blocks 
are  possible  with  preservation  of  the  normal  form. 
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^^ 
. . .  +  Rs.    From  Rt-i  ^  Ri  it  follows  that  Ri  +  R.,  is  an  invariant  subspace. 

Therefore  Ri  may  be  put  in  the  place  of  K^-i  and  then  we  have 

Rt-i^Rs-ii   Rf^Ra' 

Continuing  this  process,  we  finally  reach  the  conclusion  that  s  =  t  and 
that  the  decompositions  (70)  and  (71)  coincide  apart  from  the  order  of  the 
terms.  The  corresponding  normal  forms  then  coincide  to  within  a  permuta- 
tion of  the  blocks. 

From  the  uniqueness  of  the  normal  form  it  follows  that  the  numbers 
g  and  s  are  invariants  of  the  non-negative  matrix  A.^^ 

2.  Making  use  of  the  normal  form,  we  shall  now  prove  the  following 
theorem : 

Theorem  6 :  To  the  maximal  characteristic  value  r  of  the  matrix  A^O 
there  belongs  a  positive  characteristic  vector  if  and  only  if  in  the  normal 
form  (69)  (>/  A :  1)  each  of  the  matrices  Ai,  A2,  .  .  . ,  Ag  has  r  as  a  charac- 
teristic value;  and  (in  case  g  <^  s)  2)  none  of  the  matrices  Ag+i,  .  .  . ,  As  has 
this  property. 

Proof.  1.  Let  2;  >  0  be  a  positive  characteristic  vector  belonging  to  the 
maximal  characteristic  value  r.  In  accordance  with  the  dissection  into 
blocks  in  (69)  we  dissect  the  column  z  into  parts  z'^  (k  =  l,2,  .  .  . ,  s).  Then 
the  equation 

Az  =  rz      (z>o)  (72) 

is  replaced  by  two  systems  of  equations 


A,z'  =  rz*         (i=l,2,...,g). 

(72') 

h-i 

2J  Af^z"  +  Afi  =  ra         (/  =  5,  +  1,  . . 

-.«). 

(72") 

From   (72')   it  follows  that  r  is  a  characteristic  value  of  each  of  the 
matrices  Ai,  A2,  .  .  . ,  Ag.    From  (72")  we  find : 

Af^  ^  rz^ ,     Ap¥^  rzJ      (7  =  gr  +  1 ,  . . . ,  s) .  (73) 

We  denote  by  Tj  the  maximal  characteristic  value  oi  A,  {j  =  g  -h  1, .  .  .  ,s). 
Then  (see  (41)  on  p.  65)  we  find  from  (73)  : 

r;^max^-^^'^r      (/=gr-f-l,  ...,  s). 


30  For  an  irreducible  matrix,  g  =zs=l. 
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On  the  other  hand,  the  equation  Vj  =  r  would  contradict  the  second  of  the 
relations  (73)  (see  Note  5  on  p.  65).    Therefore 

r}<r      (/=gr-M,  ...,  5).  (74) 

2.  Suppose  now,  conversely,  that  the  maximal  characteristic  values  of 
the  matrices  Ai  {i  =  l,  2,  . .  .  ,  g)  are  equal  to  r  and  that  (74)  holds  for  the 
matrices  Aj  {j  =  g  +  1,  .  .  . ,  5).  Then  by  replacing  the  required  equation 
(72)  by  the  systems  (72'),  (72")  we  can  define  positive  characteristic  col- 
umns z^  of  the  matrices  Ai  {i  =  l,  2,  .  .  .  ,  g)  hy  means  of  (72').  Next  we 
find  columns  z^  (j  =  g  +  1,  .  .  .  ,  s)  from  (72")  : 

z^={rEj-Aj)-^2;A^,z^      {j  =  g  +  h  ...,5),  (75) 

where  Ej  is  the  unit  matrix  of  the  same  order  as  Aj  (j  =  g  +  1,  .  .  .  ,  s). 
Since  Vj  <C  r  (j  =  g  +  1,  .  .  .  ,  5),  we  have  (see  (55)  on  p.  69) 

{rEi-Aj)-^>0       {j^g+l,  ...,s).  (76) 

Let  us  prove  by  induction  that  the  columns  ;2^+^,  .  .  . ,  z^  defined  by  (75) 
are  positive.  We  shall  show  that  for  every  j  (fi'  +  1  =  i  =  s)  the  fact  that 
z^,  z^,  . . . ,  z^~^  are  positive  implies  that  z^'  >  0.    Indeed,  in  this  case, 

;-l  /-I 

which  in  conjunction  with  (76)  yields,  by  (75)  : 

zJ  >  0. 

Thus,  the  positive  column  z'=  {z^,  . .  . ,  z^)  is  a  characteristic  vector  of  A 
for  the  characteristic  value  r.    This  completes  the  proof  of  the  theorem. 

3.  The  following  theorem  gives  a  characterization  of  a  matrix  A  ^  0 
which  together  with  its  transpose  A7  has  the  property  that  a  positive  char- 
acteristic vector  belongs  to  the  maximal  characteristic  value. 

Theorem  7  :^^  To  the  maximal  characteristic  value  r  of  a  matrix  A^O 
there  belongs  a  positive  characteristic  vector  both  of  A  and  of  A"""  if  and  only 
if  A  can  he  represented  hy  a  permutation  in  quasi-diagonal  form 

A  =  [A^,A^,..,,A,],  (77) 

where  Ai,  A2,  .  .  . ,  As  are  irreducible  matrices  each  of  which  has  r  as  its 
maximal  characteristic  value. 


3^See  [166], 
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Proof.  Suppose  that  A  and  A"^  have  positive  characteristic  vectors  for 
A  =  r.  Then,  by  Theorem  6,  A  is  representable  in  the  normal  form  (69), 
where  A^,  A2, .  .  . ,  Ag  have  r  as  maximal  characteristic  value  and  (for  ^  <  s) 
the  maximal  characteristic  values  of  A^+i,  .  .  . ,  Ag  are  less  than  r.    Then 

\  ...  0  ^;+i.,  ...A 


A'M 


A^   A^ 

0  ^;+i 


0    ...    0    0  ...    AJ 

Let  us  reverse  here  the  order  of  the  blocks  in  this  matrix : 
'AJ  0  0  .  .  .  0 

^L-1    ^J-1         0  ...0 


(78) 


Al 


1,1 


...Al^ 


Since  Al,  ^J_i ,  .  .  .  ,  A^  are  irreducible,  we  obtain  a  normal  form  for  (78) 
by  a  permutation  of  the  blocks,  placing  the  isolated  blocks  first  along  the 
main  diagonal.  One  of  these  isolated  blocks  is  Aj  .  Since  the  normal  form 
of  AJ  must  satisfy  the  conditions  of  the  preceding  theorem,  the  maximal 
characteristic  value  of  A^  must  be  equal  to  r.  This  is  only  possible  when 
gr  =  s.  But  then  the  normal  form  (69)  goes  over  into  (77). 
If,  conversely,  a  representation  (77)  of  A  is  given,  then 


A-={Al 


A^ 


Al). 


(79) 


We  then  deduce  from  (77)  and  (79),  by  the  preceding  theorem,  that  A  and 
A^  have  positive  characteristic  vectors  for  the  maximal  characteristic  value  r. 
This  proves  the  theorem. 

Corollary.  If  the  maximal  characteristic  value  r  of  a  matrix  A^O 
is  simple  and  if  positive  characteristic  vectors  'belong  to  r  both  in  A  and  A7, 
then  A  is  irreducible. 
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Since,  conversely,  every  irreducible  matrix  has  the  properties  of  this 
corollary,  these  properties  provide  a  spectral  characterization  of  an  irre- 
ducible non-negative  matrix. 

§  5.    Primitive  and  Imprimitive  Matrices 

1.    We  begin  with  a  classification  of  irreducible  matrices. 

Definition  3 :  If  an  irreducible  matrix  A^O  has  h  characteristic 
values  Xi,  l^,  .  .  .  ,  l-k  of  maximal  modulus  r  {h'=\h\=^  -  -  •^\  h\='r), 
then  A  is  called  primitive  if  h=^l  and  imprimitive  i/  /i  >  1.  h  is  called  the 
index  of  imprimitivity  of  A. 

The  index  of  imprimitivity  h  is  easily  determined  if  the  coefficients  of 
the  characteristic  equation  of  the  matrix  are  known 

A  (X)^r  +  aj!"^  +  a^l""^  +  •  ••  +  a.^'  =0 

namely :  h  is  the  greatest  common  divisor  of  the  differences 

n  —  n^,  n^  —  n^,  ...,nf_i  —  n^.  (80) 

For  by  Frobenius'  theorem  the  spectrum  of  A  in  the  complex  >l-plane 
goes  over  into  itself  under  a  rotation  through  27i/h  around  the  point  A  =  0. 
Therefore  the  polynomial  A(X)  must  be  obtained  from  some  polynomial 
g{fi)  by  the  formula 

AW=g{X^)X-\ 

Hence  it  follows  that  h  is  a  common  divisor  of  the  differences  (80).  But 
then  h  is  the  greatest  common  divisor  d  of  these  differences,  since  the  spec- 
trum does  not  change  under  a  rotation  by  27i/d,  which  is  impossible  for  h  <  d. 
The  following  theorem  establishes  an  important  property  of  a  primitive 
matrix : 

Theorem  8 :  A  matrix  A^  0  is  primitive  if  and  only  if  some  power  of 
A  is  positive : 

AP>  0        (p^l).  (81) 

Proof.  If  A^  >  0,  then  A  is  irreducible,  since  the  reducibility  of  A 
would  imply  that  of  A^.  Moreover,  for  A  we  have  /i  =  1,  since  otherwise 
the  positive  matrix  A^  would  have  h  (>  1)  characteristic  values 

of  maximal  modulus  r^,  and  this  contradicts  Perron's   theorem. 
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Suppose  now,  conversely,  that  A  is  primitive.     We  apply  the  formula 
(23)  of  Chapter  V  (Vol.  I,  p.  107)  to  AP 


^'".^(-^• 


1) 


(mk-1) 


(82) 


where 


^(A)  =  (;i-Ai)-^(A-A2)--..(A-AJ-^     aj¥=?^ftoT  jy^f) 

k  yj  (A) 

is  the  minimal  polynomial  of  A,  xp{X)  =  o  _  ^Amk  (^  =  1,  2,  .  .  .  ,  s)  and  C{X) 

=  {XE  —  A)-\{X)  is  the  reduced  adjoint  matrix. 
In  this  case,  we  can  set : 


Then  (82)  assumes  the  form 


•  ^  I AJ     and     Wi  =  1 . 


(83) 


V'(0        ^,^(^A:-l)! 


Hence  it  is  easy  to  deduce  by  (83)  that 

i;^  Av_C{r) 


C(A)APl("**-^> 


y,{X) 


(84) 


On  the  other  hand,  C(r)  >  0  (see  (53) )  and  yj'ir)  >  0  by  (83).    There- 


fore 


p-*'00 


and  so  (73)  must  hold  from  some  p  onwards.^^    This  completes  the  proof. 
We  shall  now  prove  the  following  theorem : 

Theorem  9 :  If  A^  0  is  an  irreducible  matrix  and  some  power  A^  of 
A  is  reducible,  then  A^  is  completely  reducible,  i.e.,  A^  can  be  represented 
by  means  of  a  permutation  in  the  form 


A^=[A^,A^,...,A,}. 


(85) 


where  Ai,  A2,  .  .  . ,  Aa  are  irreducible  matrices  having  one  and-  the  same 
maximal  characteristic  value.  Here  d  is  the  greatest  common  divisor  of  q 
and  h,  where  h  is  the  index  of  imprimitivity  of  A. 


32  As  regards  a  lower  bound  for  the  exponent  p  in  (81),  see  [3841. 
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Proof.  Since  A  is  irreducible,  we  know  by  Frobenius'  theorem  that 
positive  characteristic  vectors  belong  to  the  maximal  characteristic  value  r, 
both  in  A  and  in  A7.  But  then  these  positive  vectors  are  also  characteristic 
vectors  of  the  non-negative  matrices  A^  and  {A^Y  for  the  characteristic 
value  X^r^.  Therefore  by  applying  Theorem  7  to  J.«,  we  represent  this 
matrix  (after  a  suitable  permutation)  in  the  form  (85) ,  where  A^,  A2, .  .  .  ,Ad 
are  irreducible  matrices  with  the  same  maximal  characteristic  value  r^. 
But  A  has  h  characteristic  values  of  maximal  modulus  r : 

r,re,  . . .,  re^-'^      \s=e^  /. 
Therefore  A^  also  has  h  characteristic  values  of  maximal  modulus 

among  which  d  are  equal  to  r^.    This  is  only  possible  when  d  is  the  greatest 
common  divisor  of  q  and  h.    This  proves  the  theorem. 
For  h  =  l,  we  obtain : 

Corollary  1 :  A  power  of  a  primitive  matrix  is  irreducible  and  primi- 
tive. 

If  we  set  q^h  in  the  theorem,  then  we  obtain : 

Corollary  2  :  If  A  is  an  imprimitive  matrix  with  index  of  imprimitivity 
h,  then  A^  splits  into  h  primitive  matrices  with  the  same  maximal  charac- 
teristic value. 

§  6.    Stochastic  Matrices 

1.    We  consider  n  possible  states  of  a  certain  system 

Si,S2,...,8n  (86) 

and  a  sequence  of  instants 

U,  ^1,  t2,  .  .  .  . 

Suppose  that  at  each  of  these  instants  the  system  is  in  one  and  only  one 
of  the  states  (86)  and  that  pij  denotes  the  probability  of  finding  the  system 
in  the  state  Sj  at  the  instant  tjc  if  it  is  known  that  at  the  preceding  instant 
tk-i  the  system  is  in  the  state  Si  {i,  j  =  1,  2,  .  .  .  ,  n  ;  k  =  1,  2,  .  .  .) .  We  shall 
assume  that  the  transition  probability  pij  {i,  j^l,2,  .  .  .  ,n)  does  not  depend 
on  the  index  h  (of  the  instant  #&). 

If  the  matrix  of  transition  probabilities  is  given, 
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then  we  say  that  we  have  a  homogeneous  Markov  chain  with  a  finite  number 
of  states.^^    It  is  obvious  that 

n 

Pij^O,    2:pij  =  l        {i,j=l,2,...,n).  (87) 

7=1 

Definition  4:  A  square  matrix  P=  11  pa  11 1  is  called  stochastic  if  P  is 
non-negative  and  if  the  sum  of  the  elements  of  each  row  of  P  is  1,  i.e.,  if  the 
relations  (87)  hold.^^ 

Thus,  for  every  homogeneous  Markov  chain  the  matrix  of  transition 
probabilities  is  stochastic  and,  conversely,  every  stochastic  matrix  can  be 
regarded  as  the  matrix  of  transition  probabilities  of  some  homogeneous 
Markov  chain.  This  is  the  basis  of  the  matrix  method  of  investigating  homo- 
geneous Markov  chains. ^^ 

A  stochastic  matrix  is  a  special  form  of  a  non-negative  matrix.  There- 
fore all  the  concepts  and  propositions  of  the  preceding  sections  are  applicable 
to  it. 

We  mention  some  specific  properties  of  a  stochastic  matrix.  From  the 
definition  of  a  stochastic  matrix  it  follows  that  it  has  the  characteristic  value 
1  with  the  positive  characteristic  vector  2;  =  (1,  1,  .  .  . ,  1),  It  is  easy  to  see 
that,  conversely,  every  matrix  P^  0  having  the  characteristic  vector 
(1,  1,  .  .  . ,  1)  for  the  characteristic  value  1  is  stochastic.  Moreover,  1  is  the 
maximal  characteristic  value  of  a  stochastic  matrix,  since  the  maximal  char- 
acteristic value  is  always  included  between  the  largest  and  the  smallest  of 
the  row  sums^'^  and  in  a  stochastic  matrix  all  the  row  sums  are  1.  Thus, 
we  have  proved  the  proposition : 

1.  A  non-negative  matrix  P^O  is  stochastic  if  and  only  if  it  has  the 
characteristic  vector  (1,  1,  ...,  1)  for  the  characteristic  value  1.  For  a 
stochastic  ynatrix  the  maximal  characteristic  value  is  1. 

Now  let  ^=11  dik  111  be  a  non-negative  matrix  with  a  positive  maximal 
characteristic  value  r  >  0  and  a  corresponding  positive  characteristic  vector 

Z=  (2!i,  2!2,  .  ..  ,  Zn)    >  0 '. 


33  See  [212]  and  [46],  pp.  9-12. 

n 

34  Sometimes  the  additional   condition    ^pg  =^  0  (j^l,  2,  .  .  . ,  n)  is  included  in  the 

i  =1 
definition  of  a  stochastic  matrix.    See  [46],  p.  13. 

35  The  theory  of  homogeneous  Markov  chains  with  a  finite  (and  a  countable)  number 
of  states  was  introduced  by  Kolmogorov  (see  [212]).  The  reader  can  find  an  account 
of  the  later  introduction  and  development  of  the  matrix  method  with  applications  to 
homogeneous  Markov  chains  in  the  memoir  [329]  and  in  the  monograph  [46]  by  V.  I. 
Eomanovskii  (see  also  [4],  Appendix  5). 

36  See  (37)  and  the  note  on  p.  68. 
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(88) 


We  introduce  the  diagonal  matrix  Z  =  {zi,  Z2,  .  .  . ,  Zn)  and  the  matrix 

^=l|Piill? 


Then 


and  by  (88) 


Thus 


r 


Pij=y  ^tVj  ^0       {i,j  =  \,2,..,,n)^ 


2^Vij  =  l        (*  =  1,2,  ...,7i) 


2.  A  non-negative  matrix  A'^0  with  the  maximal  positive  characteristic 
value  r  >  0  and  with  a  corresponding  positive  characteristic  vector 
z=  (zi,Z2, . . .  Zn)  >  o  is  similar  to  the  product  of  r  and  a  stochastic  matrix  :^^ 


A  =  ZrPZ-^       {Z  =[z^,  z^,...,  zj  >  0) . 


(89) 


In  a  preceding  section  we  have  given  (see  Theorem  6,  §  4)  a  characteriza- 
tion of  the  class  of  non-negative  matrices  having  a  positive  characteristic 
vector  for  A  =  r.  The  formula  (89)  establishes  a  close  connection  between 
this  class  and  the  class  of  stochastic  matrices. 

2.    We  shall  now  prove  the  following  theorem : 

Theorem  10 :  To  the  characteristic  value  1  of  a  stochastic  matrix  there 
always  correspond  only  elementary  divisors  of  the  first  degree. 

Proof.    We  apply  the  decomposition  (69),  §4,  to  the  stochastic  matrix 

p=\\Pi^ri- 

A^       0 0 


A, 


0 


where  Ai,  A2, 


0  .    . 

^,0 

.    .    0 

-^g+l,  1 

^g+1^  g  -^17+1 

.    .    0 

An    . 

A., 

.    .    A 

,  As  are 

irreducible  and 

37  Proposition  2.  also  holds  for  r  =  0,  since  A^O,  z^  0  implies  that  A  =  0. 
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^/i  +  ^/2  +  •  •  •  +  ^f,f-i¥^0      (/  =9r  +  1,  . . . ,  5) . 

Here  Ai,  A2,  .  .  . ,  Ag  are  stochastic  matrices,  so  that  each  has  the  simple 
characteristic  value  1.  As  regards  the  remaining  irreducible  matrices 
Ag+i,  ...,  As,  hy  the  Remark  2  on  p.  63  their  maximal  characteristic  values 
are  less  than  1,  since  in  each  of  these  matrices  at  least  one  row  sum  is  less 
than  1.38 

Thus,  the  matrix  P  is  representable  in  the  form 


where  in  Qi  to  the  value  1  there  correspond  elementary  divisors  of  the  first 
degree,  and  where  1  is  not  a  characteristic  value  of  Q2-  The  theorem  now 
follows  immediately  from  the  following  lemma: 

Lemma  4 :    If  a  matrix  A  has  the  form 


(90) 


where  Qi  and  Q2  are  square  matrices,  and  if  the  characteristic  value  Xq  of  A 
is  also  a  characteristic  value  of  Qi,  hut  not  of  Q2, 

1^1-^0^1  =  0,       \Q^-?i^E\y^O, 

then  the  elementary  divisors  of  A  and  Qi  corresponding  to  the  characteristic 
value  Ao  are  the  same. 

Proof.  1.  To  begin  with,  we  consider  the  case  where  Qi  and  Q2  do  not 
have  characteristic  values  in  common.  Let  us  show  that  in  this  case  the 
elementary  divisors  of  Qi  and  Q2  together  form  the  system  of  elementary 
divisors  of  A,  i.e.,  for  some  matrix  T  ( |  T  |  7^  0) 

TAT-^=(^'-     ^].  (91) 

We  shall  look  for  the  matrix  T  in  the  form 


38  These  properties  of  the  matrices  Ai,  .  . . ,  A,  also  follow  from  Theorem  6. 
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(the  dissection  of  T  into  blocks  corresponds  to  that  oi  A;Ei  and  E2  are  unit 
matrices).    Then 


TAT-'^=^ 


El    0\IQt^     0 
U     E, 


fQ,     OU    E,    0\J  Q,  0\ 

\s    qJ\-u  eJ    [uq^-q^u+s  qJ'   ' 


The  equation  (91')  reduces  to  (91)  if  we  choose  the  rectangular  matrix 
U  so  that  it  satisfies  the  matrix  equation 

If  Qi  and  Q2  have  no  characteristic  values  in  common,  then  this  equation 
always  has  a  unique  solution  for  every  right-hand  side  8  (see  Vol.  I,  Chapter 
VIII,  §3). 

2.  In  the  case  where  Qi  and  Q2  have  characteristic  values  in  common, 
we  replace  Qi  in  (90)  by  its  Jordan  form  /  (as  a  result,  A  is  replaced  by  a 
similar  matrix).  Let  J={JiJ2},  where  all  the  Jordan  blocks  with  the 
characteristic  value  Xq  are  combined  in  Ji.    Then 


A  = 


0     0     0^ 


Q, 


This  matrix  falls  under  the  preceding  case,  since  the  matrices  Ji  and  Q2  have 
no  characteristic  values  in  common.  Hence  it  follows  that  the  elementary 
divisors  of  the  form  {X  —  ko)^  are  the  same  for  A  and  Ji  and  therefore  also 
for  A  and  Q\.    This  proves  the  lemma. 

If  an  irreducible  stochastic  matrix  P  has  a  complex  characteristic  value 
Xo  with  I  Ao  I  =  1,  then  X^P  is  similar  to  P  (see  (16) )  and  so  it  follows  from 
Theorem  10  that  to  Xo  there  correspond  only  elementary  divisors  of  the  first 
degree.  With  the  help  of  the  normal  form  and  of  Lemma  4  it  is  easy  to 
extend  this  statement  to  reducible  stochastic  matrices.    Thus  we  obtain : 

Corollary  1.  If  Xo  is  a  characteristic  value  of  a  stochastic  matrix  P  and 
I  ^0  I  =  1,  then  the  elementary  divisors  corresponding  to  Xq  are  of  the  first 
degree. 

From  Theorem  10  we  also  deduce  by  2.  (p.  84)  : 

Corollary  2.  If  a  positive  characteristic  vector  belongs  to  the  maximal 
characteristic  value  r  of  a  non-negative  matrix  A,  then  all  the  elementary 
divisors  of  A  that  belong  to  a  characteristic  value  Xo  with  \  Xo\=r  are  of  the 
first  degree. 
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We  shall  now  mention  some  papers  that  deal  with  the  distribution  of  the 
characteristic  values  of  stochastic  matrices. 

A  characteristic  value  of  a  stochastic  matrix  P  always  lies  in  the  disc 
M  I  =  1  of  the  2-plane.  The  set  of  all  points  of  this  disc  that  are  character- 
istic values  of  any  stochastic  matrices  of  order  n  will  be  denoted  by  Mn. 
3.  In  1938,  in  connection  with  investigation  on  Markov  chains  A.  N.  Kol- 
mogorov  raised  the  problem  of  determining  the  structure  of  the  domain  M„. 
This  problem  was  partially  solved  in  1945  hj  N.  A.  Dmitriev  and  E.  B. 
Dynkin  [133],  [133a]  and  completely  in  1951  in  a  paper  by  F.  I.  Karpelevich 
[209].  It  turned  out  that  the  boundary  of  Mn  consists  of  a  finite  number 
of  points  on  the  circle  |  >^  |  =  1  and  certain  curvilinear  arcs  joining  these 
points  in  cyclic  order. 

We  note  that  by  Proposition  2.  (p.  84)  the  characteristic  values  of  the 
matrices  A=  11  (liA;  ll"  ^0  having  a  positive  characteristic  vector  for  ^  =  r 
with  a  fixed  r  form  the  set  vMn.^^  Since  every  matrix  A=  ||  a^fc  ||i  ^  0 
can  be  regarded  as  the  limit  of  a  sequence  of  non-negative  matrices  of  that 
type  and  the  set  r  •  Mn  is  closed,  the  characteristic  values  of  arbitrary  matrices 
^=  II  fl^ifc  111  ^0  with  a  given  maximal  characteristic  value  r  fill  out  the 
setr-i!f^.*o 

A  paper  by  H.  R.  Suleimanova  [359]  is  relevant  in  this  context;  it  con- 
tains sufficiency  criteria  for  n  given  real  numbers  X\,  X2,  .  .  . ,  An  to  be  the 
characteristic  values  of  a  stochastic  matrix  P=  ||  pij  ||i  .*^ 


§  7.    Limiting  Probabilities  for  a  Homogeneous  Markov  Chain 
with  a  Finite  Number  of  States 

1.   Let 

be  all  the  possible  states  of  a  system  in  a  homogeneous  Markov  chain  and  let 
^—  II  Pij  II I  be  the  stochastic  matrix  determined  by  this  chain  that  is  formed 
from  the  transition  probabilities  pij  (i,  j  =  1,2,  ...  ,  n)  (see  p.  82) . 

We  denote  by  p^p  the  probability  of  finding  the  system  in  the  state  Sj  at 
the  instant  t^c  if  it  is  known  that  at  the  instant  i^c-q  it  is  in  the  state  8i 
(i,  j  =  l,  2,  ...,  n;  q  =  l,  2,  ..  .).     Clearly,  p^i^=Pij  {i,  j  =  l,  2,  .  ..,  n). 


39  r  •  Mn  is  the  set  of  points  in  the  A.-plane  of  the  form  r/z,  where  fi  c  Mn. 

40  Kolmogorov  has  shown   (see  [133a  (1946)],  Appendix)   that  this  problem  for  an 
arbitrary  matrix  A"^  0  can  be  reduced  to  the  analogous  problem  for  a  stochastic  matrix. 

'ii  See  also  [312]. 
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Making  use  of  the  theorems  on  the  addition  and  multiplication  of  probabili- 
ties, we  find  easily : 


or,  in  matrix  notation 


pT''=^p'^Pm      (i,;=l,2,....») 

h=l 


Hence,  by  giving  to  q  in  succession  the  values  1,  2,  .  .  . ,  we  obtain  the  impor- 
tant f  ormula^^ 

\\pf\\=P'      (2  =  1,2....). 
If  the  limits 

^^pf  =  Pl^      (^?=1,  2,  ...,  w) 

q—^oo 

or,  in  matrix  notation, 

^'-♦■oo 

exist,  then  the  values  p^  (i,  j  =  l,  2,  .  .  . ,  n)  are  called  the  limiting  or  final 
transition  pr  oh  abilities  .^^ 

In  order  to  investigate  under  what  conditions  limiting  transition  proba- 
bilities exist  and  to  derive  the  corresponding  formulas,  we  introduce  the  fol- 
lowing terminology. 

We  shall  call  a  stochastic  matrix  P  and  the  corresponding  homogeneous 
Markov  chain  regular  if  P  has  no  characteristic  values  of  modulus  1  other 
than  1  itself  and  fully  regular  if,  in  addition,  1  is  a  simple  root  of  the 
characteristic  equation  of  P. 

A  regular  matrix  P  is  characterized  by  the  fact  that  in  its  normal  form 
(69)  (p.  75)  the  matrices  Ai,  A2, .  .  . ,  Ag  are  primitive.  For  a  fully  regular 
matrix  we  have,  in  addition,  ^  =  1. 

Furthermore,  a  homogeneous  Markov  chain  is  irreducible,  reducible, 
acyclic  or  cyclic  if  the  stochastic  matrix  P  of  the  chain  is  irreducible,  reduc- 
ible, primitive,  or  imprimitive,  respectively.  Just  as  a  primitive  stochastic 
matrix  is  a  special  form  of  a  regular  matrix,  so  an  acyclic  Markov  chain  is  a 
special  form  of  a  regular  chain. 

We  shall  prove  that :  Limiting  transition  prohahilities  exist  for  regular 
homogeneous  Markov  chains  only. 


42  It  follows  from  this  formula  that  the  probabilities  p^j  as  well  as  pu   (i,  j  =  l,  2, 
3,  . . . ,  n;  q=^l,  2,  .  .  .)  do  not  depend  on  the  index  Tc  of  the  original  instant  tk. 

43  The  matrix  P=°,  as  the  limit  of  stochastic  matrices,  is  itself  stochastic. 
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For  let  t/;(A)  be  the  minimal  polynomial  of  the  regular  matrix  P=  11  p^y  11 1  . 
Then 

,^,i^X)  =  {X-X^r^{X-XX-"'{X-X:)^     iky^X,',  i,k=l,2,...,u),  (92) 
By  Theorem  10  we  may  assume  that 

Ai  =  l,     mi  =  l.  (93) 
By  the  formula  (23)  of  Chapter  V  (Vol.  I,  p.  107), 


.^(1)  .     ft        1 


V(l)        *=2 


(94) 


where  C (X)  =  {XE  —  P)~'^y)(X)  is  the  reduced  adjoint  matrix  and 

^(^'  =  ^X^*       (^  =  1.2,. ...«); 
moreover 

If  P  is  a  regular  matrix,  then 

|A,|<1       {k  =  2,3,...,u), 

and  therefore  all  the  terms  on  the  right-hand  side  of  (94),  except  +he  first, 
tend  to  zero  for  q^>  oo.  Therefore,  for  a  regular  matrix*  P  the  matrix  P* 
formed  from  the  limiting  transition  probabilities  exists,  and 

P'^^^.  (95) 

V  (1) 

The  converse  proposition  is  obvious.    If  the  limit 

P"=  Um  P«  (96) 

q-*oo 

exists,  then  the  matrix  P  cannot  have  any  characteristic  value  X^  for  which 
Xjc^l  and  I  i^fc  I  =  1,  since  then  the  limit  lim  XI    would  not  exist.     (This 

limit  must  exist,  since  the  limit  (96)  exists.) 

We  have  proved  that  the  matrix  P*  exists  for  a  regular  homogeneous 
Markov  chain  (and  for  such  a  regular  chain  only).  This  matrix  is  deter- 
mined by  (95). 
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We  shall  now  show  that  F*  can  be  expressed  by  the  characteristic  poly- 
nomial 

A  (A)  =  {?i-  X^r^  (A  -  A2)««  ...  (A  -  A J««  (97) 

and  the  adjoint  matrix  B(X)  =  (A^  — P)-M(A). 
From  the  identity 

A  (A)       y,  (A) 
it  follows  by  (92),  (93),  and  (97)  that 

7li^(ni-l)  (i)_C(i) 
Aini)  (i)      ""^'(1)* 

Therefore  (95)  may  be  replaced  by  the  formula 

_  n,Bin,-i)  (1) 
■^    -      AM{1)     '  ^^^^ 

For  a  fully  regular  Markov  chain,  inasmuch  as  it  is  a  special  form  of  a 
regular  chain,  the  matrix  P*  exists  and  is  determined  by  (95)  or  (^98).  In 
this  case  ni  =  1,  and  (98)  assumes  the  form 


Doo  — — 


BJXi 
A'{1) 


(99) 


2.    Let  us  consider  a  regular  chain  of  general  type  (not  fully  regular) .    We 
write  the  corresponding  matrix  P  in  the  normal  form 


iQi 


0      0 


p= 


0 


g+1,1 


Qg       0 


(100) 


\u. 


V,^    .    .    .    ,    C7,.,_i  Qj 
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where  Qi,  . .  • ,  Qg  are  primitive  stochastic  matrices  and  the  maximal  values 
of  the  irreducible  matrices  Qg+i,  . .  . ,  Qs  are  less  than  1.    Setting 

^=\ '      ^=      


we  write  P  in  the  form 


P  = 


Then 


and 


'Q^    .    ,    ,    0  0 


0    .    ,    ,    Q,         0 
U  Wi 


'01  .   .   ,0      0 

p^=|    ;     *  .    ;      •    I  (101) 

0  .  .  .g^    0 


0    ...Q; 


But  TT*  =lim  W^  =  0,  because  all  the  characteristic  values  of  W  are  of 
modulus  less  than  1.    Therefore 


(102) 


Since  Qi,  ■  •  .  ,  Qg  are  primitive  stochastic  matrices,  the  matrices  Q^ ,  . .  . , 
Q*  are  positive,  by  (99)  and  (35)  (p.  62) 
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Q->0,  ...,Q->0, 

and  in  each  of  these  matrices  all  the  elements  belonging  to  any  one  column 
are  equal : 

<^r=\\mh=i     (A  =  1.2.  ...,?)• 

We  note  that  the  states  Si,  82,  ■  ■  • ,  Sn  of  the  system  fall  into  groups  cor- 
responding to  the  normal  form  (100)  of  P: 

Z^,  Za,  . . .,  2",,  Z^^^,  . . .,  r,  (103) 

To  each  group  2  in  (103)  there  corresponds  a  group  of  rows  in  (100). 
In  the  terminology  of  Kolmogorov  the  states  of  the  system  that  occur  in 
Ui,  2*2,  .  .  .  ,  Zg  are  called  essential  and  the  states  that  occur  in  the  remaining 
groups  2*^+1,  .  .  .  ,  Zs  non-essential. 

From  the  form  (101)  of  P^  it  follows  that  in  any  finite  number  q  of  steps 
(from  the  instant  tjc-g  to  tjc)  only  the  following  transitions  of  the  system  are 
possible :  a )  from  an  essential  state  to  an  essential  state  of  the  same  group ; 
b)  from  a  non-essential  state  to  an  essential  state ;  and  c)  from  a  non-essential 
state  to  a  non-essential  state  of  the  same  or  a  preceding  group. 

From  the  form  (102)  of  P*  it  follows  that:  A  limiting  transition  can 
only  lead  from  an  arbitrary  state  to  an  essential  state,  i.e.,  the  probability 
of  transition  to  any  non-essential  state  tends  to  zero  when  the  number  of 
steps  q  tends  to  infinity.  The  essential  states  are  therefore  sometimes  also 
called  limiting  states. 

3.    From  (95)  it  follows  that*^ 

Hence  it  is  clear  that :  Every  column  of  P"^  is  a  characteristic  vector  of  the 
stochastic  matrix  P  for  the  characteristic  value  ^  =  1. 

For  a  fully  regular  matrix  P,  1  is  a  simple  root  of  the  characteristic  equa- 
tion and  (apart  from  scalar  factors)  only  one  characteristic  vector  (1,  1,  .  .  .  ,  1) 
of  P  belongs  to  it.  Therefore  all  the  elements  of  the  ,/-th  column  of  P  °° 
are  equal  to  one  and  the  same  non-negative  number  p^. : 

n 

p^=p-^0      ij=.l,2,...,n;    ^p*j=l).  (104) 

7=1 


41  See  [212]  and  [46],  pp.  37-39. 

4^  This  formula  holds  for  an  arbitrary  regular  chain  and  can  be  obtained  from  the 
obvious  equation  PQ  —  P*  Pq—^  =^  0  by  passing  to  the  limit  q^  co. 
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Thus,  in  a  fully  regular  chain  the  limiting  transition  probabilities  do 
not  depend  on  the  initial  state. 

Conversely,  if  in  a  regular  homogeneous  Markov  chain  the  limiting 
transition  probabilities  do  not  depend  on  the  initial  state,  i.e.,  if  (104)  holds, 
then  obviously  in  the  scheme  (102)  for  P"  we  have  g  =  l.  But  then  ni  =  l 
and  the  chain  is  fully  regular. 

For  an  acyclic  chain,  which  is  a  special  case  of  a  fully  regular  chain, 
P  is  a  primitive  matrix.  Therefore  F'i  ^  0  (see  Theorem  8  on  p.  80)  for 
some^>0.    But  then  also  P*  ^P^P^  >  0.^« 

Conversely,  it  follows  from  P"^  ^  0  that  P^  ^  0  for  some  g  >  0,  and 
this  means  by  Theorem  8  that  P  is  primitive  and  hence  that  the  given  homo- 
geneous Markov  chain  is  acyclic. 

We  formulate  these  results  in  the  following  theorem : 

Theorem  11 :  1.  In  a  homogeneous  Markov  chain  all  the  limiting  tramsi- 
tion  prohaMlities  exist  if  and  only  if  the  chain  is  regular.  In  that  case  the 
matrix  P*  formed  from  the  limiting  transition  probabilities  is  determined 
by  (95)  or  (98). 

2.  In  a  regular  homogeneous  Markov  chain  the  limiting  transition  proba- 
bilities are  independent  of  the  initial  state  if  and  only  if  the  chain  is  fully 
regular.    In  that  case  the  matrix  P*  is  determined  by  (99). 

3.  In  a  regular  homogeneous  Markov  chain  all  the  limiting  transition 
probabilities  are  different  from  zero  if  and  only  if  the  chain  is  acyclic.'^'' 

4.    We  now  consider  the  columns  of  absolute  probabilities 

P  =  (kk--;k)  {k  =  0,l,2,...),  (105) 

k 

where  pi  is  the  probability  of  finding  the  system  in  the  state  Si  (i  =  l,2,  .  .  .  , 
n;  k  =  0,  1,  2,  .  .  .)  sX  the  instant  t^.  Making  use  of  the  theorems  on  the 
addition  and  multiplication  of  probabilities,  we  find : 


0 

P 


i=2JPkPM  {i  =  l,2,...,n',k  =  l,2,.. .) 


h=l 

or,  in  matrix  notation. 


'^6  This  matrix  equation  is  obtained  by  passing  to  the  limit  m  -^  oo  from  the  equation 
pm=  pm-q  .  pq  (m  >  q) .  P*  is  a  stochastic  matrix;  therefore  P'^  ^0  and  there  are 
non-zero  elements  in  every  row  of  P*.  Hence  P'^Pq^  0.  Instead  of  Theorem  8  we  can 
use  here  the  formula  (99)  and  the  inequality  (35)   (p.  62). 

^7  Note  that  P'^  "^  0  implies  that  the  chain  is  acyclic  and  therefore  regular.  Hence  it 
follows  automatically  from  P'^  ^  0  that  the  limiting  transition  probabilities  do  not 
depend  on  the  initial  state,  i.e.,  that  the  formulas  (104)  hold. 
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p  =  {p-^fp      (yfc  =  l,2,  ...),  (106) 

where  P"""  is  the  transpose  of  P. 

All  the  absolute  probabilities  (105)  can  be  determined  from  (106)  if  the 

initial  probabilities  pi,  P2,  -  -  • ,  Pn  and  the  matrix  of  transition  probabilities 
■P=  II  Vi)  II 1   are  known. 

We  introduce  the  limiting  absolute  probabilities 

oo  k 

Pi  =  '^mpi         (^  =  1,2,  ...,  w) 

k-*oo 

or 

oo  oo         oo  oo  ]c 

P={VvP2>  ...,  ^jj^lim^}. 

;i;->oo 

When  we  take  the  limit  /c  ->  oo  on  both  sides  of  (106),  we  obtain : 

p=iP-rp.  (107) 

Note  that  the  existence  of  the  matrix  of  limiting  transition  probabilities 
P*  implies  the  existence  of  the  limiting  absolute  probabilities 


P  =  {Pl>  P2,"',  Pn) 

for  arbitrary  initial  probabilities  p  =  [pi,  p2,  .  .  . ,  Pn) ,  and  vice  versa. 

From  the  formula  (107)  and  the  form  (102)  of  P*  it  follows  that :   The 
limiting  absolute  probabilities  corresponding  to  non-essential  states  are  zero. 

Multiplying  both  sides  of  the  matrix  equation 

by  p  on  the  right,  we  obtain  by  (107)  : 

P^p^p,  (108) 

i.e. :  The  column  of  limiting  absolute  probabilities  p  is  a  characteristic  vector 
of  P"""  for  the  characteristic  value  X  =  1. 

If  a  fully  regular  Markov  chain  is  given,  then  A  =  1  is  a  simple  root  of 
the  characteristic  equation  of  P""".    In  this  case,  the  column  of  limiting  abso- 

oo 

lute  probabilities  is  uniquely  determined  by  (108)   (because  p^  ^0  (j  = 
1,  2,  ...,n)  andf^^.  =1). 
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Suppose  that  a  fully  regular  Markov  chain  is  given.  Then  it  follows 
from  (104)  and  (107)  that: 

n  n 

P  —  2J PhPhi=P7? 2J Ph  =  v7i        0*=1»  2,  ...,n).  (109) 

h=l  h=l 

.     .     .  °°  °**  oo 

In  this  case  the  limiting  absolute  probabilities  p^,  p^,  . . .,  p^  do  not 
depend  on  the  initial  probabilities  p^,  Pz,  •  ■  •  ,Pn- 

Conversely,  p  is  independent  of  p  on  account  of  (107)  if  and  only  if  all 
the  rows  of  P*  are  equal,  i.e., 

pTj=pZ      {h,  j=l,  2,  ...,n) 

so  that  (by  Theorem  11)  P  is  a  fully  regular  matrix. 
If  P  is  primitive,  then  F'^  ^  0  and  hence,  by  (109), 

^•>0         {j=l,2,...,n). 

Conversely,  if  all  the  Pj  (i  =  1,  2,  .  .  .  ,  n)  are  positive  and  do  not  depend 
on  the  initial  probabilities,  then  all  the  elements  in  every  column  of  P*  are 
equal  and  by  (109)  P°°  ^  0,  and  this  means  by  Theorem  11  that  P  is  primi- 
tive, i.e.,  that  the  given  chain  is  acyclic. 

From  these  remarks  it  follows  that  Theorem  11  can  also  be  formulated 
as  follows : 

Theorem  11' :  1.  In  a  homogeneous  Markov  chain  all  the  limiting  abso- 
lute probabilities  exist  for  arbitrary  initial  probabilities  if  and  only  if  the 
chain  is  regular. 

2.  In  a  homogeneous  Markov  chain  the  limiting  absolute  probabilities 
exist  for  arbitrary  initial  probabilities  and  are  independent  of  them  if  and 
only  if  the  chain  is  fully  regular. 

3.  In  a  homogeneous  Markov  chain  positive  limiting  absolute  probabili- 
ties exist  for  arbitrary  initial  probabilities  and  are  independent  of  them  if 
and  only  if  the  chain  is  acyclic.'^^ 

5.    We  now  consider  a  homogeneous  Markov  chain  of  general  type  with  a 
matrix  P  of  transition  probabilities. 


4^  The  second  part  of  Theorem  11'  is  sometimes  called  the  ergodic  theorem  and  the 
first  part  the  general  quasi-ergodic  theorem  for  homogeneous  Markov  chains  (see  [4], 
pp.  473  and  476). 


96  XIII.    Matrices  with  Non-Negative  Elements 

We  choose  the  normal  form  (69)  for  P  and  denote  by  hi,  h2,  .  .  . ,  hg  the 
indices  of  imprimitivity  of  the  matrices  Ai,  A2,  .  .  . ,  Ag  m  (69).  Let  h  be 
the  least  common  multiple  of  the  integers  hi,  h2,  . .  . ,  hg.  Then  the  matrix 
P^  has  no  characteristic  values,  other  than  1,  of  modulus  1,  i.e.,  P^  is  regular ; 
here  h  is  the  least  exponent  for  which  P^  is  regular.  We  shall  call  h  the 
period  of  the  given  homogeneous  Markov  chain. 

Since  P^  is  regular,  the  limit 

lim  pM  =  i^ph)oo 

q-¥oo 

exists  and  hence  the  limits 

P,=  lim  P'-+<?A=pr(pA)oo  (^^0,  1,  ...,^  —  1) 

also  exist. 

Thus,  in  general,  the  sequence  of  matrices 

p,  p^  p^ . . . 

splits  into  h  subsequences  with  the  limits  Pr^=P^(P^)'^  {r=0,l,  .  .  .  ,h  —  1). 
When  we  go  from  the  transition  probabilities  to  the  absolute  probabili- 
ties by  means  of  (106),  we  find  that  the  sequence 

12     3 

p,p,p,... 
splits  into  h  subsequences  with  the  limits 

]hn"p'=  (P^^)->p     (r=  0,  1,  2,  . . . ,  ^  -  1). 

For  an  arbitrary  homogeneous  Markov  chain  with  a  finite  number  of 
states  the  limits  of  the  arithmetic  means  always  exist : 

P  =  lim  -^  ^P*  =  4-(^  +  -^  +  *"  +  -P*~^H^*)~  (11^) 


J^^co    N   ,±i  h 


and 


^=:lim  ^  i;p  =  P''p,  (110') 

Here  P  =  ||  %  \\\  and  p  =  (pi,  p2,  •  •  ■  ,  Pn) •  The  values  pij  (i,  j  =  1,  2, 
S,  .  . . ,  n)  and  Pj  (j  =  l,  2,  .  .  .  ,  n)  are  called  the  mean  limiting  fransiiion 
prohahilities  and  mean  limiting  absolute  probabilities,  respectively. 


i 
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Since 

we  have 

PP=P 

and  therefore,  by  (110'), 

P^P  =  P;  (111) 

i.e.,  p  is  a  characteristic  vector  of  P"""  for  X  =  l. 

Note  that  by  (69)  and  (110)  we  may  represent  P  in  the  form 


where 


1       ^  iV 

■ii  =  lim^2;4(*'  =  l>2,  ...,9r)     fr^lim-^  j;if*, 


Since  all  the  characteristic  values  of  W  are  of  modulus  less  than  1,  we 
have 

and  therefore  W  =  0. 
Hence 

'  Ji  0  .   .   .  0 
0  A^.   .   .00 

J  (112) 

0  0  .   .   .A^ 

U  0^ 

Since  P  is  a  stochastic  matrix,  the  matrices  Ai,  A2,  .  .  . ,  Ag  are  also 
stochastic. 
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From  this  representation  of  P  and  from  (107)  is  follows  that :  The  mean 
limiting  absolute  probabilities  corresponding  to  non-essential  states  are 
always  zero. 

If  gr  =  1  in  the  normal  form  of  P,  then  2  =  1  is  a  simple  characteristic 
value  of  P""". 

In  this  case  p  is  uniquely  determined  by  (111),  and  the  mean  limiting 
probabilities  pi,  p2,   •  •  • ,  Pn  do  not  depend   on  the  initial  probabilities 

Pi,  ^2j  •  •  •  >  Pn-    Conversely,  if  p  does  not  depend  on  p,  then  P  is  of  rank  1 
by  (110').    But  the  rank  of  (112)  can  be  1  only  if  ^  =  1. 
We  formulate  these  results  in  the  following  theorem  :*^ 
Theorem  12 :    For  an  arbitrary  homogeneous  Markov  chadn  with  period 

h  the  probability  matrices  P^  and  p  tend  to  a  periodic  repetition  with  period 
h  for  yt  ->  00  ;  moreover,  the  mean  limiting  transition  probabilities  and  the 
absolute  probabilities  P  =  ||  pu  11 1  and  p  =  (pi,  p2,  •  •  . ,  Pn)  defined  by  (110) 
and  (110')  always  exist. 

The  mean  absolute  probabilities  corresponding  to  non-essential  states  are 
always  zero. 

If  g^lin  the  normal  form  of  P  {and  only  in  this  case),  the  mean  limit- 
ing absolute  probabilities  pi,  p2, .  . . ,  Pn  are  independent  of  the  initial  proba- 
bilities ^1,  ^2^  '  '  '  ,Pn  d'i^d  are  uniquely  determined  by  (111). 

§  8.    Totally  Non-negative  Matrices 

In  this  and  the  following  sections  we  consider  real  matrices  in  which  not 
only  the  elements,  but  also  all  the  minors  of  every  order  are  non-nesrative. 
Such  matrices  have  important  applications  in  the  theory  of  small  oscilla- 
tions of  elastic  systems.  The  reader  will  find  a  detailed  study  of  these 
matrices  and  their  applications  in  the  book  [17].  Here  we  shall  only  deal 
with  some  of  their  basic  properties. 

1.    We  begin  with  a  definition  : 

Definition  5  :    A  rectangular  matrix 

^  =  \\(^ik\\       (*  =  1»  2,  ...,m;  k  =  l,2,  ...,n) 

is  called  totally  non-negative  (totally  positive)  if  all  its  minors  of  any  order 
are  non-negative  (positive)  : 


^9  This  theorem  is  sometimes  called  the  asymptotic  theorem  for  homogeneous  Markov 
chains.    See  [4],  pp.  479-82. 
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"'I    fc^  •  '  '  f^j 


h  <h  <"'<^p 


A^i  <!  "^2  "^  *  '  *  "^     p 


(>0) 
1,  2,  . .  .,  min  {m,n) 


In  what  follows  we  shall  only  consider  square  totally  non-negative  and 
totally  positive  matrices. 

Example  1.    The  generalized  Vandermonde  matrix 


V=^a- 


7k 


{0  <ai  <a2<  ...  <an;  ai  <  a2<  ...<  an) 


is  totally  positive.  Let  us  show  first  that  |  T  |  =7^  0.  Indeed,  from  |  T  |  =  0 
it  would  follow  that  we  could  determine  real  numbers  Ci,  C2,  .  .  .  ,  Cn,  not  all 
equal  to  zero,  such  that  the  function 


{aiy^aj  for  i^j) 


has  the  n  zeros  Xi^ a^  {i^l,  2,  .  .  .  ,  n) ,  where  n  is  the  number  of  terms  in 
the  above  summand.  For  n=^l  this  is  impossible.  Let  us  make  the  induc- 
tion hypothesis  that  it  is  impossible  for  a  sum  of  n^  terms,  where  tii  <  n, 
and  show  that  it  is  then  also  impossible  for  the  given  function  f{x).  Assume 
the  contrary.  Then  by  Rolle's  Theorem  the  function  f\{x\  =  [x~"^f{x)y 
consisting  of  n  —  1  terms  would  have  n  —  1  positive  zeros,  and  this  contra- 
dicts the  induction  hypothesis. 

Thus,  I  y  I  =7^  0.  But  for  ai  =  0,  02  =  1,  .  .  . ,  «„  =  n  —  1  the  determinant 
I  Y  I  goes  over  into  the  ordinary  Vandermonde  determinant  |  a^*~^|i,  which 
is  positive.  Since  the  transition  from  this  to  the  generalized  Vandermonde 
determinant  can  be  carried  out  by  means  of  a  continuous  change  of  the 
exponents  a-i,  02,  .  .  . ,  an  with  preservation  of  the  inequalities  ai  <  02  <  .  .  . 
<  an,  and  since,  by  what  we  have  shown,  the  determinant  does  not  vanish 
in  this  process,  we  have  |  "F  |  >  0  for  arbitrary  0  <  ai  <  02  <  .  .  .  <  a„. 

Since  every  minor  of  V  can  be  regarded  as  the  determinant  of  some  gen- 
eralized Vandermonde  matrix,  all  the  minors  of  V  are  positive. 

Example  2.    We  consider  a  JacoM  matrix 


J= 


Cj  ttg  62  • 
0    Co  a,. 


0   0    0 


0  0 
0  0 
0     0 


(113) 
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in  which  all  the  elements  are  zero  outside  the  main  diagonal  and  the  first 
super-diagonal  and  sub-diagonal.  Let  us  set  up  a  formula  that  expresses  an 
arbitrary  minor  of  the  matrix  in  terms  of  principal  minors  and  the  elements 
h,  c.    Suppose  that 

"'I  <C  "^2  "^  *   '  *  "^C  "'p 

and 

then 

Ui h -"^p/     Ui . . . kJ  \ ^vj+i/      \kJ  \ ^v,+i . . •  kJ 

This  formula  is  a  consequence  of  the  easily  verifiable  equation : 

■'{;::::;')='C:::t:)-'a')-'a:::::3  """-"■'■  '■■« 

From  (114)  it  follows  that  every  minor  is  the  product  of  certain  prin- 
cipal minors  and  certain  elements  of  J.  Thus:  For  J  to  he  totally  non- 
negative  it  is  necessary  and  sufficient  that  all  the  principal  minors  and  the 
elements  1),  c  should  he  non-negative. 

2.  A  totally  non-negative  matrix  A=  11  a^fc  11 "  always  satisfies  the  follow- 
ing important  determinantal  inequality  :^° 

/I     2  ..,n\         (l     2  ...  p\      fp  +  l  ...n\ 

Before  deriving  this  inequality,  we  prove  the  following  lemma : 

Lemma  5:  If  in  a  totally  non-negative  matrix  A=  ||  aa  11 1  any  prin- 
cipal minor  vanishes,  then  every  principal  minor  ^  h ordering '  it  also  vanishes. 

Proof.  The  lemma  will  be  proved  if  we  can  show  that  for  a  totally  non- 
negative  matrix  A  =  11  ai^  11?  it  follows  from 


^°  See   [172]   and   [17],  pp.  lllff,   where  it  is  also  shown  that  the  equality  sign  in 
(116)  can  only  hold  in  the  following  obvious  cases: 

1)  One  of  the  factors  on  the  right-hand  side  of  (116)  is  zero; 

2)  All  the  elements  aik  (1=1,2,  ...,  p;  h  =  p  -{-  1,  .  .  . ,  n)  or  aik  (i  =  p  -\-  1,  .  .  .  ,  n; 
Tc=l,2, . . .  ,p)  are  zero. 

The  inequality  (116)  has  the  same  outward  form  as  the  generalized  Hadamard  inequal- 
ity (see  (33),  Vol.  I,  p.  255)  for  a  positive-definite  hermitian  or  quadratic  form. 
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that 


c 


2  ...q 
2...q 

< 


2  ... 
2  ... 


0      {q<n) 


=  0^ 


(117) 
(118) 


For  this  purpose  we  consider  two  cases 


1)  ^11  =  0.    Since 


—  anaijc  ^  0,  an  ^  0,  a^c  ^  0  (i,  k  =  2, 


n)j  either  all  the  aii  =  0  ii^2,  . . . ,  n)  or  all  the  aik  =  0  {k=^2, 
These  equations  and  an  =  0  imply  (118). 
2)  an  =7^  0.    Then  for  some  p  (I'^p'^q) 


n) 


A 


2  ...  2?  — 1     p 


2  ...  p  —  lj  \l     2 

We  introduce  bordered  determinants 


2?  — 1     p 


=  0. 


/I     2...P  —  1     i\ 
^»*^^    1      o  17         {i.k  =  p,p^-l,...,n) 

\1     2  . . .  p  —  \     kj 

and  form  from  them  a  matrix  D  =  11  dik  lip. 

By  Sylvester's  identity  (Vol.  I,  Chapter  II,  §  3), 

\«^l       "^2    •  •  •    '^gJ 

1     2  ...  2?  — l\>-i     /I     2 
.1     2...??-ljJ      ^\1     2 

/  h    <  *2    <  •  •  •  <  in  \ 

H;,<;<...</^^^  .=1,2,. ..,.-,+1), 


(119) 


(120) 


p  —  \     \     k^...  kJ 


so  that  D  is  a  totally  non-negative  matrix. 
Since  by  (119) 


'-=K  2:::')=" 


the  matrix  B  falls  under  the  case  1)  and 


j^lv    P  +  1  ...  M 
\p    p  +  l  ..,  n) 


1     2  ...  p  —  l 
1     2...^  — 1 


"-P     /I     2  ... 
A 

1     2 


nj 
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Since  A  L  ^  ' ' '     _i )  =7^  0,  (118)  follows,  and  the  lemma  is  proved. 

3.  We  may  now  assume  in  the  derivation  of  the  inequality  (116)  that  all 
the  principal  minors  of  A  are  different  from  zero,  since  by  Lemma  5  one  of 
the  principal  minors  can  only  be  zero  when  |  A  |  =  0,  and  in  this  case  the 
inequality  (116)  is  obvious. 

For  n  =  2,  (116)  can  be  verified  immediately: 


1     2 
1     2 


—  ff  11^22         ^2^21  =  ^11^22 


since  0^12  ^  0,  a2i  ^  0.  We  shall  establish  (116)  for  ti  >  2  under  the  assump- 
tion that  it  is  true  for  matrices  of  order  less  than  n.  Moreover,  without  loss 
of  generality,  we  may  assume  that  p  >  1,  since  otherwise  by  reversing  the 
numbering  of  the  rows  and  columns  we  could  interchange  the  roles  of  p 
and  n  —  p. 

We  now  consider  again  the  matrix  D  =  11  di]c  K,  where  the  da  (i,  k  =  p, 
p  +  1,  . .  . ,  n)  are  defined  by  (120)  ;  we  use  Sylvester's  identity  twice  as  well 
as  the  basic  inequality  (116)  for  matrices  of  order  less  than  n  and  obtain: 


1     2  . 

1     2  . 


j^/V    P  +  1...M 


dr,r.D 


P+l 


(122) 


Thus,  the  inequality  (116)  has  been  established. 
Let  us  make  the  following  definition : 
Definition  6.     A  minor 


A 


K-1        K 


2    • 


<^. 


fC-^  <Z.  iC^  <C  •  •  •  <C  l^r 


<n 


(123) 


of  the  matrix  A  =  11  anc  p  will  he  called  almost  principal  if  of  the  differences 
ii  —  ki,  12  —  k2, .  .  .  ,ip  —  kp  only  one  is  not  zero. 
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We  can  then  point  out  that  the  whole  derivation  of  (116)  (and  the  proof 
of  the  auxiliary  lemma)  remain  valid  if  the  condition  'A  is  totally  non- 
negative'  is  replaced  by  the  weaker  condition  'all  the  principal  and  almost 
principal  minors  of  A  are  non-negative.'^^ 


§  9.    Oscillatory  Matrices 

1.  The  characteristic  values  and  characteristic  vectors  of  totally  positive 
matrices  have  a  number  of  remarkable  properties.  However,  the  class  of 
totally  positive  matrices  is  not  wide  enough  from  the  point  of  view  of  appli- 
cations to  small  oscillations  of  elastic  systems.  In  this  respect,  the  class  of 
totally  non-negative  matrices  is  suffiently  extensive.  But  the  spectral 
properties  we  need  do  not  hold  for  all  totally  non-negative  matrices.  Now 
there  exists  an  intermediate  class  (between  that  of  totally  positive  and  that 
of  totally  non-negative  matrices)  in  which  the  spectral  properties  of  totally 
positive  matrices  are  preserved  and  which  is  of  sufficiently  wide  scope  for 
the  applications.  The  matrices  of  this  intermediate  class  have  been  called 
'  oscillatory. '  The  name  is  due  to  the  fact  that  oscillatory  matrices  form  the 
mathematical  apparatus  for  the  study  of  oscillatory  properties  of  small  vibra- 
tions of  elastic  systems. ^^ 

Definition  7.  A  matrix  A=\\  aikK  is  called  oscillatory  if  A  is  totally 
non-negative  and  if  there  exists  an  integer  g  >  0  such  that  A^  is  totally 
positive. 

Example.  A  Jacobi  matrix  J  (see  (113))  is  oscillatory  if  and  only  if 
1.  all  the  numbers  h,  c  are  positive  and  2.  the  successive  principal  minors  are 
positive : 


•''1  See  [214].  We  take  this  opportunity  of  mentioning  that  in  the  second  edition  of  the 
book  [17]  by  F.  E.  Gantmacher  and  M.  G.  Krein  a  mistake  crept  in  which  was  first 
pointed  out  to  the  authors  by  D.  M.  Kotelyanskii.  On  p.  Ill  of  that  book  an  almost 
principarl  minor  (123)  was  defined  by  the  equation 

With  this  definition,  the  inequality  (116)  does  not  follow  from  the  fact  that  the  principal 
and  the  almost  principal  minors  are  non-negative.    However,  all  the  statements  and  proofs 
of  §  6,  Chapter  II  in  [17]  that  refer  to  the  fundamental  inequality  remain  valid  if  an 
almost  principal  minor  is  defined  as  above  and  as  we  have  done  in  the  paper  [214]. 
52  See  [17],  Introduction,  Chapter  III,  and  Chapter  IV. 
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ai>0, 


>0, 


>0, 


61    0 

02    ^2 
Co    a. 


0  0 
0  0 
0     0 


0      0     0 


ttr 


>0.      (124) 


Necessity  of  1.,  2,  The  numbers  h,  c  are  non-negative,  because  J  ^  0. 
But  none  of  the  numbers  h,  c  may  be  zero,  since  otherwise  the  matrix  would 
be  reducible  and  then  the  inequality  J^  y  0  could  not  hold  for  any  g  >  0. 
Hence,  all  the  numbers  h,  c  are  positive.  All  the  principal  minors  of  (124) 
are  positive,  by  Lemma  5,  since  it  follows  from  |  J  |  ^  0  and  |  J^  |  >  0  that 
|J|>0. 

Sufficiency  of  1.,  2.  When  we  expand  |  J  |  we  easily  see  that  the  num- 
bers h,  c  occur  in  I  J"  I  only  as  products  hiCi,  &2C2,  ■  •  ■ ,  &n-ic„_i.  The  same 
applies  to  every  principal  minor  of  'zero  density,'  i.e.,  a  minor  formed  from 
successive  rows  and  columns  (without  gaps).  But  every  principal  minor  of 
J  is  a  product  of  principal  minors  of  zero  density.  Therefore  :  In  every  prin- 
cipal minor  of  J  the  numbers  h  and  c  occur  only  as  products  hiCi,  &2C2,  .  .  .  , 

hn-lCn-V 

We  now  form  the  symmetrical  Jacobi  matrix 


h 
a^ 
b. 


h 


hi=ihiCi>Q        (i==l,2,  ...,  n).         (125) 


From  the  above  properties  of  the  principal  minors  of  a  Jacobi  matrix  it 
follows  that  the  corresponding  principal  minors  of  J  and  J  are  equal.  But 
then  (124)  means  that  the  quadratic  form 

J{x,  x) 

is  positive  definite  (see  Vol.  I,  Chapter  X,  Theorem  3,  p.  306).  But  in  a 
positive-definite  quadratic  form  all  the  principal  minors  are  positive.  There- 
fore in  J  too  all  the  principal  minors  are  positive.  Since  by  1.  all  the  numbers 
h,  c  are  positive,  by  (114)  all  the  minors  of  /  are  non-negative;  i.e.,  /  is 
totally  non-negative. 

That  a  totally  non-negative  matrix  /  for  which  1.  and  2.  are  satisfied  is 
oscillatory  follows  immediately  from  the  following  criterion  for  an  oscilla- 
tory matrix. 
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A  totally  non-negative  matrix  A=  ||  aik  ||T  is  oscillatory  if  and  only  if: 

1)  A  is  non-singular  (|  A  |  >  0)  ; 

2)  All  the  elements  of  A  in  the  principal  diagonal  and  the  first  super- 
diagonals  and  sub-diagonals  are  different  from  zero  (c^ifc  >  0  for  \i  —  A:  |  ^  1). 

The  reader  can  find  a  proof  of  this  proposition  in  [17], Chapter  II,  §  7. 

2.  In  order  to  formulate  properties  of  the  characteristic  values  and  charac- 
teristic vectors  of  oscillatory  matrices,  we  introduce  some  preliminary  con- 
cepts and  notations. 

We  consider  a  vector  (column) 

Let  us  count  the  number  of  variations  of  sign  in  the  sequence  of  coordinates 
Ui,  U2,  .  .  . ,  Un  of  u,  attributing  arbitrary  signs  to  the  zero  coordinates  (if 
any  such  exist).  Depending  on  what  signs  we  give  to  the  zero  coordinates 
the  number  of  variations  of  sign  will  vary  within  certain  limits.  The 
maximal  and  minimal  number  of  variations  of  sign  so  obtained  will  be  de- 
noted by  S;t  and  Su,  respectively.  If  Su  =  Sji',  we  shall  speak  of  the  exact 
number  of  sign  changes  and  denote  it  by  Su.  Obviously  S;r  =  'S'^  if  and  only 
if  1.  the  extreme  coordinates  ui  and  Un  of  u  are  different  from  zero,  and 
2.  1^1  =  0  (1  <  t  <  n)  always  implies  that  Ui-iUi+i  <  0. 
We  shall  now  prove  the  following  fundamental  theorem : 

Theorem  13:    1.    An  oscillatory  matrix  ^=11  c^ifc  11 1  always  has  n  dis- 
tinct positive  characteristic  values 

^i>h>"'>K>0-  (126) 

2.  The  characteristic  vector  u=  (un,  U21,  .  .  . ,  Uni)  of  A  that  belongs 

to  the  largest  characteristic  value  Xi  has  only  non-zero  coordinates  of  like 

2 
sign;  the  characteristic  vector  u=^  (uio,  U22,     -  • ,  ^712)  ^^^^  belongs  to  the 

second  largest  characteristic  value  X2  has  exactly  one  variation  of  sign  in  its 

k 

coordinates ;  more  generally,  the  characteristic  vector  u  =  {uik,  uok,  -  • ,  Unu) 
that  belongs  to  the  characteristic  value  Xu  has  exactly  k  —  1  variations  of  sign 
(k  =  l,2,...,n). 

3.  For    arbitrary    real    numbers    Cg,    c^+i,    ...,    Ch    (1  -^g  ^h  -^n; 

Sc\  >  0)  the  number  of  variations  of  sign  in  the  coordinates  of  the  vector 


k 


t^=2JckU  (127) 


lies  between  g  —  1  and  h  —  1 
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g-l^S-^S^^h-l.  (128) 

Proof.  1.  We  number  the  characteristic  values  Xi,  X2,  .  .  .  ,  Xn  ot  A  so 
that 

and  consider  the  p-th  compound  matrix  STp  (p  =  l,2,  .  .  . ,  n)  (see  Chapter  I, 
§4).  The  characteristic  values  of  51^  are  all  the  possible  products  of  p 
characteristic  values  of  A  (see  Vol.  I,  p.  75),  i.e.,  the  products 

From  the  conditions  of  the  theorem  it  follows  that  for  some  integer  q  A'^ 
is  totally  positive.  But  then  Wp  ^0,  ?Ip  >  0  ;^^  i.e.,  STp  is  irreducible,  non- 
negative,  and  primitive.  Applying  Frobenius'  theorem  (see  §  2,  p.  40) 
to  the  primitive  matrix  51^  (p  =  1,  2,  .  .  .  ,  ti),  we  obtain 

AiA2-..Ap>0       {p  =  l,2,,..,n), 
^ih"-^p>^i^2"'^p-i^p+i       {p  =  l,2,  ...,n  —  l). 

Hence  (126)  follows. 

2.  From  this  inequality  (126)  it  follows  that  ^=11  ciik  11 1  is  a  matrix 
of  simple  structure.  Then  all  the  compound  matrices  ^p  (p  =  l,  2,  .  .  . ,  n) 
are  also  of  simple  structure  (see  Vol.  I,  p.  74). 

We  consider  the  fundamental  matrix  Z7=  ||  i^i^  ll?  of  A  (the  k-th  column 

k 

of  U  contains  the  coordinates  of  the  k-th  characteristic  vector  u  oi  A;  k=^ 
l,2,...,n).  Then  (see  Vol.  I,  Chapter  III,  p.  74),  the  characteristic  vector 
of  ^Tp  belonging  to  the  characteristic  value  X1X2  .  .  Ap  has  the  coordinates 

uh     '^'--'A       (l^^l<^2<...<^;^n)  (129) 

By  Frobenius'   theorem  all  the  numbers  (129)  are  different  from  zero 

12  n 

and  are  of  like  sign.  Multiplying  the  vectors  u,  u,  .  .  . ,  u  hy  ±  1,  we  can 
make  all  the  minors  of  (129)  positive: 


I 


U\  •■      '  "   >0  -  ^   -  -   ^  ''-      .  (130) 


•"•3  The  matrix  211  is  the  p-th  compound  matrix  A^i  (see  Vol.  I,  Chapter  1,  p.  20.) 
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The  fundamental  matrix  U  =  11  uik  11 1  is  connected  with  A  by  the  equation 

^  =  t/{Ai,A2,  ...,A„}C/-i.  (131) 

But  then 

A-'={U-')-^?i„;i„,..,?ijU\  (132) 

Comparing  (131)  with  (132),  we  see  that 

7=(C7T)-i  (133) 

is  the  fundamental  matrix  of  A"^  with  the  same  characteristic  values  Ai,  A2, 
.  .  . ,  An-  But  since  A  is  oscillatory,  so  is  A""".  Therefore  in  V  as  well  for 
every  p  =  1,  2,  .  .  . ,  w  all  the  minors 


F     '      '***    n       (l^^l<^2<•.•<^;^7l)  (134) 

\1      2  ...  p I 

are  different  from  zero  and  are  of  the  same  sign. 

On  the  other  hand,  by  (133)  U  and  V  are  connected  by  the  equation 

U''V=E. 

Going  over  to  the  p-th.  compound  matrices  (see  Vol.  I,  Chapter  I,  §  4),  we 
have: 

Hence,  in  particular,  noting  that  the  diagonal  elements  of  @p  are  1,  we  obtain  : 

l^h<ii<' ■ '  <ip^n        \1        2    ...pj        \l        2     ...pj 

On  the  left-hand  side  of  this  equation,  the  first  factor  in  each  of  the  sum- 
mands  is  positive  and  the  second  factors  are  different  from  zero  and  are  of 
like  sign.    It  is  then  obvious  that  the  second  factors  as  well  are  positive ;  i.e., 

U      2   ...pj  \        p=l,2,...,n        I 

Thus,   the  inequalities    (130)    and    (136)    hold   for   U=\\uit\\t    and 
V=  (IT"")-'  simultaneously. 
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When  we  express  the  minors  of  V  in  terms  of  those  of  the  inverse  matrix 
y-i  =  ZJ"*"  by  the  well-known  formulas  (see  Vol.  I,  pp.  21-22),  we  obtain 

p 

\l2..,n-'pj  \U\  \nn-l  ...n-v^lj 

where  ii<i2<  -  -  ■  <ip  and  h  <  J2<  --  -  <  jn-p  together  give  the  com- 
plete system  of  indices  l,2,...,n.  Since,  by  (130),  |  Z7  |  >  0  it  follows  from 
(136)  and  (137)  that 

^       ^  \l   2  ...p)  \         p  =  l,2,...,n         I 

h  h 

Now  let  u=  ^  CkU  (  ^  c|  >  0).    We  shall  show  that  the  inequalities 

k=g  k=g 

(130)  imply  the  second  part  of  (128)  : 

S^^h-l,  (139) 

and  the  inequalities  (138),  the  first  part: 

S-^g-l.  (140) 

Suppose  that  S+  >  h  —  1.    Then  we  can  find  h  +  1  coordinates  of  u 

Ui^,  Ui^,  . . .,  WiA+i       (1  ^  *i  <  *2  <  •  •  •  <  h+i  ^  ^)  (141) 

such  that 

Furthermore,  the  coordinates  ( 141 )  cannot  all  be  zero ;  for  then  we  could 

h 

equate  the  corresponding  coordinates  of  the  vector  u  =  ^  c^u   (ci  =  . . .  — 

*  A:=l 

c^_i  =  0;  ^ c\  >  0)  to  zero  and  thus  obtain  a  system  of  homogeneous 
equations 

h 

2Jcj^Ui^j^  =  0       {(x  =  l,  2,  ...,h) 
k=i 

with  the  non-zero  solution  Ci,  02,  .  . . ,  Ch,  whereas  the  determinant  of  the 
system 

\l  2  ...  h) 
is  different  from  zero,  by  (130). 
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•    Uuh 


Uuh  Ui 


^*A+1^    •  •  •    ^^?i+l*    ^^+1 


=  0. 


We  expand  it  with  respect  to  the  elements  of  the  last  column : 

^^\_i)»+«+i„,„  vi^^--- '-'  ''^^  ■  ■  ■  Y')  =0. 

But  such  an  equation  cannot  hold,  since  on  the  left-hand  side  all  the  terms 
are  of  like  sign  and  at  least  one  term  is  different  from  zero.  Hence  the 
assumption  that  ;S^+  >  /i  —  1  has  led  to  a  contradiction,  and  (139)  can  be 
regarded  as  proved. 

We  consider  the  vector 


where 


^*=  Wk>  %*'  •  •  •»  Kk)         {k  =  l,2,  ...,  n), 

<  =  (-  1)"+*+Xa:  {i,h=l,2,...,n); 

then  for  the  matrix  Z7*  =  11  tijfc  11 J  we  have,  by  (138)  : 

\7i  71  — 1  ...  w  — ^  +  1/  \  p=l,2,...,n  / 

But  the  inequalities  (142)  are  analogous  to  (130).     Therefore,  by  setting 

^^^i'HD'c.i:  (143) 

k=g 

we  have  the  inequality  analogous  to  (139)  :^* 

B^.^n-g.  (144) 

Let  w.  =  (w-i,  1^.2, ...  ?  Wn)  and  u*  =  (uu  U2,  . . . ,  u*n).    It  is  easy  to  see  that 

u*=(-lYu,  (1  =  1,2,  ...,n). 

Therefore 


54  In  the  inequalities  (142),  the  vectors  i^  (fc  =  1,  2,  . . . ,  n)  occur  in  the  inverse  order 
n  n — 1  ff 

u,  u, The  vector  u  is  preceded  by  n  —  g  vectors  of  this  kind. 
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and  so  the  relation  (140)  holds,  by  (144). 

This  establishes  the  inequality  (128).  Since  the  second  statement  of  the 
theorem  is  obtained  from  (128)  by  setting  g  =  h  =  k,  the  theorem  is  now 
completely  proved. 

3.  As  an  application  of  this  theorem,  let  us  study  the  small  oscillations  of 
n  masses  mi,  m2, .  .  .  ,mn  concentrated  at  n  movable  points  Xi  <i  X2  <.  .  .  .  K  ^n 
of  a  segmentary  elastic  continuum  (a  string  or  a  rod  of  finite  length), 
stretched  (in  a  state  of  equilibrium)  along  the  segment  Q^x^l  of  the 
X-axis. 

We  denote  hj  K{x,s)  (0  ^  x,  s  ^  I)  the  function  of  influence  of  this 
continuum  (K(x,s)  is  the  displacement  at  the  point  x  under  the  action  of  a 
unit  force  applied  at  the  point  s)  and  by  kij  the  coefficients  of  influence  for 
the  given  n  masses : 

^i^=K  (x.,  X.)       {i,j  =  l,2,.. .,  n) . 

If  at  the  points  Xi,  x^,  .  . . ,  Xn  n  forces  F^,  F2,  . . . ,  Fn  are  applied,  then 
the  corresponding  static  displacement  y{x)  {0'^x'^l),\^  given,  by  virtue 
of  the  linear  superposition  of  displacements,  by  the  formula 


y(x)  =  ZK{x,Xj)F., 


1=1 

When  we  here  replace  the  forces  Fj  by  the  inertial  forces  -  m^  -^  y  {xp  t) 
(i  =  1,  2,  .  .  . ,  n) ,  we  obtain  the  equation  of  free  oscillations 

y(x)  =  —^  rrijK  {x,  Xj)  ^  y  (xj,  t) .  (145) 

We  shall  seek  harmonic  oscillations  of  the  continuum  in  the  form 

y{x)=u{x)sm{(jot  +  <x)     {O^x^l),  (146) 

Here  u{x)  is  the  amplitude  function,  co  the  frequency,  and  a  the  initial 
phase.  Substituting  this  expression  for  y{x)  in  (145)  and  cancelling 
sin  {cot  +  a),  we  obtain 

n 

u  {x)  =  co^  ^  nijK  {x,  Xj)  u  (xj) .  (147) 

7=1 
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Let  us  introduce  a  notation  for  the  variable  displacements  and  the  dis- 
placements in  amplitude  at  the  points  of  distribution  of  mass : 

Vi  =y{Xv  0  ,     Ui=  u  {x,)       (i  =  1,  2,  . . . ,  n). 

Then 

2/i  =  1^,.  sin  (ft>^  +  a)       (i  =  1,  2, . . . ,  w) . 

We  also  introduce  the  reduced  amplitude  displacements  and  the  reduced 
coefficients  of  influence 

Ui=  irriiUi ,     ttij  =  iniim^kij       {i,  j  =1,  2,  . . . ,  n).  (148) 

Replacing  x  in  (147)  hj  Xi  {i  =  l,  2,  .  .  . ,  n)  successively,  v^^e  obtain  a 
system  of  equations  for  the  amplitude  displacements : 


2JaijU^  =  2.Ui      (a  =  ^;     *  =  1,2,  ...,wj. 


(149) 


Hence  it  is  clear  that  the  amplitude  vector  u  =  (w^,  U2,  .  .  .,  ^„)  is  a  charac- 
teristic vector  of  A  =  II  a^^  \\l  =  \\  Y^^^j^ij  jj"  for  A  =  l/co^  (see  Vol.  I,  Chapter 
X,§8). 

It  can  be  established,  as  the  result  of  a  detailed  analysis,^^  that  the  matrix 
of  the  coefficients  of  influence  ||  A^iy  ||i  of  a  segmentary  continuum  is  always 
oscillatory.  But  then  the  matrix  J.  =  ||  ^i,  jji  —  ||  yi^i^j^ij  jli  is  also  oscilla- 
tory!   Therefore  (by  Theorem  13)  A  has  n  positive  characteristic  values 

Ai>A2>--->A„>0; 

i.e.,  there  exist  n  harmonic  oscillations  of  the  continuum  with   distinct 
frequencies : 


(0 


<)wi<W2<---<w„       K  =  ^5     i  =  \,2,...,n\. 


By  the  same  theorem  to  the  fundamental  frequency  cdi  there  correspond 
amplitude  displacements  different  from  zero  and  of  like  sign.  Among  the 
displacements  in  amplitude  corresponding  to  the  first  overtone  with  the 
frequency  0)2  there  is  exactly  one  variation  of  sign  and,  in  general,  among 
the  displacements  in  amplitude  for  the  overtone  with  the  frequency  co_,  there 
are  exactly  j  —  1  variations  of  sign  ( j  =  1,  2,  . .  . ,  n). 


55  See  [239],  [240],  and  [17],  Chapter  III. 
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From  the  fact  that  the  matrix  of  the  coefficients  of  influence  11  k^  11? 
is  oscillatory  there  follow  other  oscillatory  properties  of  the  continuum : 
1)  For  co  =  coi  the  amplitude  function  u(x),  which  is  connected  with  the 
amplitude  displacements  by  (147),  has  no  nodes ;  and,  in  general,  for  co  =  coj 
the  function  has  j  —  1  nodes  (j  =  l,2,...,n);2)  The  nodes  of  two  adjacent 
harmonics  alternate,  etc. 

We  cannot  dwell  here  on  the  justification  of  these  properties.^® 


56  See  [17],  Chapters  III  and  IV. 


CHAPTER   XIV 

APPLICATIONS  OF  THE  THEORY  OF  MATRICES 

TO  THE  INVESTIGATION  OF  SYSTEMS  OF 

LINEAR  DIFFERENTIAL  EQUATIONS 

§  1.    Systems  of  Linear  Differential  Equations  with  Variable 
Coefficients.     General  Concepts 

1.    Suppose  given  a  system  of  linear  homogeneous  differential  equations  of 
the  first  order : 

^'=i'P.*(«)^*      {i=l,2,...,n),  (1) 

where  Pik(i)  (i,k  =  l,2,...,n)  are  complex  functions  of  a  real  argument  t, 
continuous  in  some  interval,  finite  or  infinite,  of  the  variable  t.^ 

Setting  P(t)  =  II  pijc(t)  p    and  x=  {xi,  X2,  . . . ,  Xn),  we  write  (1)  as 

%  =  P(t)..  (2) 

An  integral  matrix  of  the  system  (1)  shall  be  defined  as  a  square  matrix 
X{t)  =  II  Xijc(t)  II 1  whose  columns  are  n  linearly  independent  solutions  of 
the  system. 

Since  every  column  of  X  satisfies  (2),  the  integral  matrix  X  satisfies  the 
equation 

§  =  P(t)X.  (3) 

In  what  follows,  we  shall  consider  the  matrix  equation  (3)  instead  of 
the  system  (1). 

From  the  theorem  on  the  existence  and  uniqueness  of  the  solution  of  a 
system  of  differential  equations^  it  follows  that  the  integral  matrix  X(t) 
is  uniquely  determined  when  the  value  of  the  matrix  for  some  ('initial') 


1  In  this  section,  all  the  relations  that  involve  functions  of  t  refer  to  the  given  interval. 

2  A  proof  of  this  theorem  will  be  given  in  §  5.  See  also  I.  G.  Petrowski  (Petrovskii), 
Vorlesungen  uber  die  Theorie  der  gewbhnlichen  Biff erentialgleichungen,  Leipzig,  1954 
(translated  from  the  Russian:  Moscow,  1952). 
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value  t  =  tois  known,^  X{to)  =  Xq.  For  Xj  we  can  take  an  arbitrary  non- 
singular  square  matrix  of  order  n.  In  the  particular  case  where  X(to)  =E, 
the  integral  matrix  X{t)  will  be  called  normalized. 

Let  us  differentiate  the  determinant  of  X  by  differentiating  its  rows  in 
succession  and  let  us  then  use  the  differential  relations 

dx-         "^ 

-^  =2JPik^kj      {i,j  =  l,2,  ...,  n). 


We  obtain : 


^=(Pn  +  P.2  +  "-  +  Pj|^ 


Hence  there  follows  the  well-known  Jacohi  identity 

t 

/  tr  Pdt 

\X\  =  ce'o  ,  (4) 

where  c  is  a  constant  and 

tr  P  =  pii  +  P22  +  .  .  .  +  Pnn 
is  the  trace  oiP{t). 

Since  the  determinant  |  X  \  cannot  vanish  identically,  we  have  c  7^  0. 
But  then  it  follows  from  the  Jacobi  identity  that  |  X  |  is  different  from  zero 
for  every  value  of  the  argument 

i.e.,  an  integral  matrix  is  non-singular  for  every  value  of  the  argument. 

If  X(^)  is  a  non-singular  (\  X(t)  \  y^O)  particular  solution  of  (3),  then 
the  general  solution  is  determined  by  the  formula 

X  =  XC,  (5) 

where  C  is  an  arbitrary  constant  matrix. 

For,  by  multiplying  both  sides  of  the  equation 

§  =  PX  (6) 

by  C  on  the  right,  we  see  that  the  matrix  XC  also  satisfies  (3) .    On  the  other 
hand,  if  X  is  an  arbitrary  solution  of  (3),  then  (6)  implies: 


It  is  assumed  that  to  belongs  to  the  given  interval  of  t. 
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and  hence  by  (3) 

^(l-iX)=0 

and 

X-^X  =- const  =C; 

i.e.,  (5)  holds. 

All  the  integral  matrices  X  of  the  system  ( 1 )  are  obtained  by  the  formula 
(5)  with  I  0|  v^O. 

2.    Let  us  consider  the  special  case : 

f=^X,  (7) 

where  A  is  a  constant  matrix.    Here  X  =  e^^  is  a  particular  non-singular 
solution  of  (7),^  so  that  the  general  solution  is  of  the  form 

X  =  e^'C  (8) 

where  C  is  an  arbitrary  constant  matrix. 

Setting  t  =  tom  (8)  we  find :  Xo  =  e^*o(7.  Hence  C  =  c-^'o  X^  and  there- 
fore (8)  can  be  represented  in  the  form 

X  =  e^^'-'o)XQ.  (9) 

This  formula  is  equivalent  to  our  earlier  formula  (46)  of  Chapter  V  (Vol.  I, 
p.  118). 

Let  us  now  consider  the  so-called  Cauchy  system : 

dX         A 

—-  =- X       (A  is  a  constant  matrix).  (10) 

dt       t  —  (I 

This  case  reduces  to  the  preceding  one  by  a  change  of  argument : 

u  =  ]n{t  —  a). 

Therefore  the  general  solution  of  ( 10 )  looks  as  follows : 

X=e^ln(«-a)C'^(^_^)4C  (11) 

The  functions  e^*  and  (t  —  a)^  that  occur  in  (8)  and  (11)  may  be  repre- 
sented in  the  form  (Vol.  I,  p.  117) 


-*  By  term-by-term  differentiation  of  the  series  e^*^  27  tt  ^  we  find  — -c^'  =  Ae^t 


*-0*'  ^' 
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e^'  =  j;  (Z,i  +  Z,2^  +  . . .  +  Z^    n  -1)  c^*^ ,  (12) 

s 

(^_a)^  =  j;(Z^,  +  Z,2ln(«-a)+...+Z,     [ln(f-a)n-i)(^-a)^*.      (13) 

Here 

xp  (A)  =  (A  -  Ai)"^^  (A  -  Ag)-' ...  (A  -  A,)"*^ 
(^  =7^  -^fc  for  i  7^  A: ;  i,  fc  =  1,  2, . . . ,  s) 

is  the  minimal  polynomial  of  A,  and  Z^y  (^  =  1,  2,  . . . ,  m^ ;  A:  =  1,  2,  . . . ,  s) 
are  linearly  independent  constant  matrices  that  are  polynomials  in  A.^ 

Note.  Sometimes  an  integral  matrix  of  the  system  of  differential  equa- 
tions (1)  is  taken  to  be  a  matrix  W  in  which  the  rows  are  linearly  independ- 
ent solutions  of  the  system.    It  is  obvious  that  W  is  the  transpose  of  X : 

W  =  X^. 

When  we  go  over  to  the  transposed  matrices  on  both  sides  of  (3),  we 
obtain  instead  of  ( 3 )  the  following  equation  for  W : 

f  =  TfP(0.  (3') 

Here  W  is  the  first  factor  on  the  right-hand  side,  not  the  second,  as  X  was 
in  (3). 


§  2.    Lyapunov  Transformations 

1.  Let  us  now  assume  that  in  the  system  (1)  (and  in  the  equation  (3)) 
the  coefficient  matrix  Pit)  =  \\  Piuit)  11 1  is  a  continuous  bounded  function 
of  t  in  the  interval  [^o,  <^)-^ 

In  place  of  the  unknown  functions  Xi,  X2,  •  • . ,  Xn  we  introduce  the  new 
unknown  functions  y\,  y2,  -  •  ■ ,  Vn  by  means  of  the  transformation 

n 

^i=2:iiMyk       {i=l,2,...,n).  (14) 


5  Every  term  Xh  =  {Zj^^  +  Zj^^t  -\ +  Zi-mi^*"*-^)  «^*'   (fc  =  1,  2,  .  . . ,  s)        on  the 

right-hand  side  of  (12)  is  a  solution  of  (7).  For  the  product  g{A)e^*^,  with  an  arbitrary- 
function  gi'k),  satisfies  this  equation.  But  X^^f{A)^g{A)eAt\tf{\)  =z5r(X)e^^and 
^(X")  =1,  and  all  the  remaining  m — 1  values  of  gO.)  on  the  spectrum  of  A  are  zero 
(see  Vol.  I,  Chapter  V,  formula  (17),  on  p.  104). 

^  This  means  that  each  function  pik(i)  (i,  Tc  =  l,  2,  .  . .  ,  n)  is  continuous  and  bounded 
in  the  interval  [to,  oo  ) ,  i.e.,  t  ^  to. 
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We  impose  the  following  restrictions  on  the  matrix  L{t)  =  ||  kkit)  ||i 
of  the  transformation : 

1.  L(t)  has  a  continuous  derivative  ^  in  the  interval  [^o,  oo )  ; 

2.  Lit)  and  ^-  are  bounded  in  the  interval  [#o,  «> )  ; 

3.  There  exists  a  constant  m  such  that 

0  <  m  <  absolute  value  oi  \  L{t)  \  (f  ^  ^q)  , 

i.e.,  the  determinant  \  L{t)  \  is  bounded  in  modulus  from  below  by  the  posi- 
tive constant  m. 

A  transformation  (14)  in  which  the  coefficient  matrix  L(#)  =  ||  kk{t)  p 
satisfies  1.-3.  will  be  called  a  Lyapunov  trams  formation  and  the  correspond- 
ing matrix  L{t)  a,  Lyapunov  matrix. 

Such  transformations  were  investigated  by  A.  M.  Lyapunov  in  his 
famous  memoir  'The  General  Problem  of  Stability  of  Motion'  [32]. 

Examples.  1.  If  L  =  const,  and  |  L  |  7^  0,  then  L  satisfies  the  conditions 
1.-3.  Therefore  a  non-singular  transformation  with  constant  coefficients  is 
always  a  Lyapunov  transformation. 

2.  If  Z)=  II  dijc  II 1  is  a  matrix  of  simple  structure  with  pure  imaginary 
characteristic  values,  then  the  matrix 

L{t)=eD' 

satisfies  the  conditions  1.-3.  and  is  therefore  a  Lyapunov  matrix.^ 

2.  It  is  easy  to  verify  that  the  conditions  1.-3.  of  a  matrix  L{t)  imply  the 
existence  of  the  inverse  matrix  L~^(f)  also  satisfying  the  conditions  1.-3.; 
i.e.,  the  inverse  of  a  Lyapunov  transformation  is  itself  a  Lyapunov  trans- 
formation. In  the  same  way  it  can  be  verified  that  two  Lyapunov  transfor- 
mations in  succession  yield  a  Lyapunov  transformation.  Thus,  the  Lyapunov 
transformations  form  a  group.    They  have  the  following  important  property : 

If  under  the  transformation  (14)  the  system  (1)  goes  over  into 

#=i'fe(0  2/*  (15) 


*=! 


and  if  the  zero  solution  of  this  system  is  stable,  asymptotically  stable,  or 
unstable  in  the  sense  of  Lyapunov  (see  Vol.  I,  Chapter  V,  §  6),  then  the  zero 
solution  of  the  origin^  system  (1)  ha^  the  same  property. 


^  Here  all  the  m*  =  1  in  (12)  and  Xk  =  i^)k{<Pk  real,  fc  =  1,  2,  . . . ,  5). 
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In  other  words,  Lyapunov  transformations  do  not  alter  the  character 
of  the  zero  solution  (as  regards  stability).  This  is  the  reason  why  these 
transformations  can  be  used  in  the  investigation  of  stability  in  order  to 
simplify  the  original  system  of  equations. 

A  Lyapunov  transformation  establishes  a  one-to-one  correspondence  be- 
tween the  solutions  of  the  systems  (1)  and  (15)  ;  moreover,  linearly  inde- 
pendent solutions  remain  so  after  the  transformation.  Therefore  a  Lyapunov 
transformation  carries  an  integral  matrix  X  of  (1)  into  some  integral 
matrix  Y  of  (15)  such  that 

X  =  L(t)Y.  (16) 

In  matrix  notation,  the  system  (15)  has  the  form 

%  =  Q(t)Y,  (17) 

where  Q{t)  =  11  qncit)  11?  is  the  coefficient  matrix  of  (15). 

Substituting  LY  for  X  in  (3)  and  comparing  the  equation  so  obtained 
with  (17),  we  easily  find  the  following  formula  which  expresses  Q  in  terms 

of  P  and  L : 

dL 
dt 


Q=L-^PL-L-^"^,  (18) 


Two  systems  (1)  and  (15)  or,  what  is  the  same,  (3)  and  (17)  will  be 
called  equivalent  (in  the  sense  of  Lyapunov)  if  they  can  be  carried  into  one 
another  by  a  Lyapunov  transformation.  The  coefficient  matrices  P  and  Q 
of  equivalent  systems  are  always  connected  by  the  formula  (18)  in  which 
L  satisfies  the  conditions  1.-3. 

§  3.    Reducible  Systems 

1.  Among  the  systems  of  linear  differential  equations  of  the  first  order  the 
simplest  and  best  known  are  those  with  constant  coefficients.  It  is,  there- 
fore, of  interest  to  study  systems  that  can  be  carried  by  a  Lyapunov  trans- 
formation into  systems  with  constant  coefficients.  Lyapunov  has  called  such 
systems  reducible. 

Suppose  given  a  reducible  system 

f=PX  (19. 

Then  some  Lyapunov  transformation 

X=L{t)Y  (20) 

carries  it  into  a  system 
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'i,=AY,  (21) 

where  A  is  a  constant  matrix.    Therefore  (19)  has  the  particular  solution 

Z==L(Oe^^  (22) 

It  is  easy  to  see  that,  conversely,  every  system  (19)  with  a  particular  solu- 
tion of  the  form  (22),  where  L{i)  is  a  Lyapunov  matrix  and  A  a  constant 
matrix,  is  reducible  and  is  reduced  to  the  form  (21)  by  means  of  the  Lyapu- 
nov transformation  (20). 

Following  Lyapunov,  we  shall  show  that:  Every  system  (19)  with 
periodic  coefficients  is  reducible.^ 

het  P{t)  in  (19)  be  a  continuous  function  in  ( —  oo,  +  oo  )  with  period  r: 

P{t  +  r)=Pit).  (23) 

Replacing  t  in  (19)  by  #  +  r  and  using  (23),  we  obtain : 

Thus,  Xit  +  r)  is  an  integral  matrix  of  (19)  ii  X{t)  is.^  Therefore 

X(t  +  T)=X{t)V, 

where  F  is  a  constant  non-singular  matrix.  Since  |  "T  |  ^^  0,  we  can 
determine^ 

rr-  ~'-^ 

V^=e^ 

This  matrix  function  of  t,  just  like  X(t),  is  multiplied  on  the  right  by  V 
when  the  argument  is  increased  by  r.    Therefore  the  'quotient' 

L{t)=  X{t)V~^  =X{t)e~'^^''^ 

is  .continuous  and  periodic  with  period  r: 

L{t  +  r)=L{t), 

and  with  \L\=^0.  The  matrix  L{t)  satisfies  the  conditions  1.-3.  of  the 
preceding  section  and  is  therefore  a  Lyapunov  matrix. 


8  See  [32],  §47. 

^Here  lnF  =  /(F),  where  /(X)  is  any  single-valued  branch  of  In  X  in  the  simply- 
connected  domain  G  containing  all  the  characteristic  values  of  V,  but  not  containing  0. 
See  Vol.  I,  Chapter  V. 
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On  the  other  hand,  since  the  solution  X  of  (19)  can  be  represented  in 
the  form 

InF 

X=L{t)e  ^   \ 

the  system  (19)  is  reducible. 

In  this  case  the  Lyapunov  transformation 

X  =  L{t)Y, 
which  carries  (19)  into  the  form 

at        T 

has  periodic  coefficients  with  period  t. 

Lyapunov  has  established^^  a  very  important  criterion  for  stability  and 
instability  of  a  first  linear  approximation  to  a  non-linear  system  of  differ- 
ential equations 

^  =  2:<^i,^,  +  i*^)          (i  =h  2,...,n),  (24) 

where  we  have  convergent  power  series  in  Xi,  X2,  . . . ,  Xn  on  the  right-hand 
side  and  where  (**)  denotes  the  sum  of  the  terms  of  second  and  higher  orders 
in  Xi,  X2,  .  .  . ,  Xn\  the  coefficients  ct^fc  (i,  A;  =  1,  2,  .  .  .  ,  w)  of  the  linear  terms 
are  constant.^^ 

Lyapunov's  Criterion:  The  zero  solution  of  (24)  is  stable  {and  even 
asymptotically  stable)  if  all  the  characteristic  values  of  the  coefficient  matrix 
A=\\  Oik  III  of  the  first  linear  approximation  have  negative  real  parts,  and 
unstable  if  at  least  one  characteristic  value  has  a  positive  real  part. 

2.  The  arguments  used  above  enable  us  to  apply  this  criterion  to  a  system 
whose  linear  terms  have  periodic  coefficients : 

dx-         " 

W=^ft*W=^*  +  M-  (25) 

k=l 

For  on  the  basis  of  the  preceding  arguments  we  reduce  the  system  (25)  to 
the  form  (24)  by  means  of  a  Lyapunov  transformation,  where 


10  See  [32],  §24. 

11  The  coefficients  in  the  non-linear  terms  may  depend  on  t.    These  functional  coeffi- 
cients are  subject  to  certain  restrictions  (see  [32],  §  11). 


§4.  Canonical  Form  OF  Reducible  System.  Erugin 's  Theorem    121 

and  where  V  is  the  constant  matrix  by  which  an  integral  matrix  of  the  cor- 
responding linear  system  (19)  is  multiplied  when  the  argument  is  changed 
by  r.  Without  loss  of  generality,  we  may  assume  that  t  >  0.  By  the  prop- 
erties of  Lyapunov  transformations  the  zero  solutions  of  the  original  and  of 
the  transformed  systems  are  simultaneously  stable,  asymptotically  stable, 
or  unstable.  But  the  characteristic  values  Xi  and  Vi  {i=l,  2,  .  .  .  ,  n)  of.  A 
and  V  are  connected  by  the  formula 

A,=  iln^',     {i  =  l,2,...,n). 

Therefore,  by  applying  Lyapunov 's  criterion  to  the  reduced  systems  we 
find:^2 

The  zero  solution  of  (25)  is  asymptotically  stable  if  all  the  characteristic 
values  VI,  V2,  '  ■  • ,  vn  of  V  are  of  modulus  less  than  1  and  unstable  if  at  least 
one  characteristic  value  is  of  modulus  greater  than  1. 

Lyapunov  has  established  his  criterion  for  the  stability  of  a  linear  ap- 
proximation for  a  considerably  wider  class  of  systems,  namely  those  of  the 
form  (24)  in  which  the  linear  approximation  is  not  necessarily  a  system  with 
constant  coefficients,  but  belongs  to  a  class  of  systems  that  he  has  called 
regular.^^ 

The  class  of  regular  linear  systems  contains  all  the  reducible  systems. 

A  criterion  for  instability  in  the  case  when  the  first  linear  approxima- 
tion is  a  regular  system  was  set  up  by  N.  G.  Chetaev.^^ 


§  4.    The  Canonical  Form  of  a  Reducible  System.     Erugin's  Theorem 

1.    Suppose  that  a  reducible  system  (19)  and  an  equivalent  system 

(in  the  sense  of  Lyapunov)  are  given,  where  A  is  a  constant  matrix. 

We  shall  be  interested  in  the  question :  To  what  extent  is  the  matrix  A 
determined  by  the  given  system.  (19)  ?  This  question  can  also  be  formu- 
lated as  follows : 


^2  Loc.  cit.,  §  55. 
13  Loc.  cit.,  §  9. 
i*See  [9],  p.  181. 


122   XIV.  Applications  to  Systems  of  Linear  Differential  Equations 
When  are  two  systems 

%=AY    and     §=BZ, 

where  A  and  B  are  constant  matrices,  equivalent  in  the  sense  of  Lyapunov; 
i.e.,  when  can  they  he  carried  into  one  another  hy  a  Lyapunov  transfor- 
mation f 

In  order  to  answer  this  question  we  introduce  the  notion  of  matrices  with 
one  and  the  same  real  part  of  the  spectrum. 

We  shall  say  that  two  matrices  A  and  B  of  order  n  have  one  and  the  same 
real  part  of  the  spectrum  if  and  only  if  the  elementary  divisors  of  A  and  B 
are  of  the  form 

(A-Airs  (A-Aa)-*, . . .,  a-A,r-  (A-^irs  {x-ix^)^, . . .,  a-z^.r, 

where 

"Rq Xjc=^'Rq fijc     (A;  =  l,  2,  . . .,  s). 

Then  the  following  theorem  due  to  N.  P.  Erugin  holds  :^^ 
Theorem  1  (Erugin) :    Two  systems 

%  =  AT    and     f  =  5Z  (26) 

{A  and  B  a/re  constant  matrices  of  order  n)  are  equivalent  in  the  sense  of 
Lyapunov  if  and  only  if  the  matrices  A  and  B  have  one  and  the  same  real 
part  of  the  spectrum. 

Proof.  Suppose  that  the  systems  (26)  are  given.  We  reduce  A  to  the 
normal  Jordan  form^^  (see  Vol.  I,  Chapter  VI,  §  7) 

A  =  T{  Ai^i  +  ^1,  hE2  +  ^2,  .  •  :  KEs  +  H,}T-\  (27) 

where 

Afc  =  Ofc  +  iPjc         (ajc,  h  ^^^  real  numbers ;  A;  =  1,  2, . . . ,  s).         (28) 

In  accordance  with  (27)  and  (28)  we  set 


^5  Our  proof  of  the  theorem  differs  from  that  of  Erugin. 

^®  Ek  is  the  unit  matrix;  in  Ilk  the  elements  of  the  first  superdiagonal  are  1,  and  the 
remaining  elements  are  zero;  the  orders  of  Ek,  Hk  are  the  degrees  of  the  fc-th  elementary- 
divisor  of  A,  i.e.,  Wfc  (Tc=l,  2,  . . . ,  s). 
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Then 

A=Ai  +  A^,     AiA^  =  A^Ai.  (30) 

We  define  a  matrix  L(t)  by  the  equation 

L(t)  is  a  Lyapunov  matrix  (see  Example  2  on  p.  IIT). 

But  by  (30)  a  particular  solution  of  the  first  of  the  systems  (26)  is  of 
the  form 

Hence  it  follows  that  the  first  of  the  systems  (26)  is  equivalent  to 

'^=A,U,  (31) 

where,  by  (29),  the  matrix  Ai  has  real  characteristic  values  and  its  spec- 
trum coincides  with  the  real  part  of  the  spectrum  of  A. 

Similarly,  we  replace  the  second  of  the  systems  (26)  by  the  equivalent 
system 

%  =  B,V,  (32) 

where  the  matrix  Bi  has  real  characteristic  values  and  its  spectrum  coincides 
with  the  real  part  of  the  spectrum  of  B. 

Our  theorem  will  be  proved  if  we  can  show  that  the  two  systems  (31) 
and  (32)  in  which  Ai  and  Bi  are  constant  matrices  with  real  characteristic 
values  are  equivalent  if  and  only  if  Ax  and  Bi  are  similar}"^ 

Suppose  that  the  Lyapunov  transformation 

carries  (31)  into  (32).    Then  the  matrix  Li  satisfies  the  equation 

"li^^iii-iA.  (33) 

This  matrix  equation  for  Li  is  equivalent  to  a  system  of  n^  differential 
equations  in  the  n^  elements  of  Li.  The  right-hand  side  of  (33)  is  a  linear 
operation  on  the  Sector'  Li  in  an  ti^-dimensional  space 


'^'^  This  proposition  implies  Theorem  1,  since  the  equivalence  of  the  systems  (31)  and 
(32)  means  that  the  systems  (26)  are  equivalent,  and  the  similarity  of  Ai  and  Bi  means 
that  these  matrices  have  the  same  elementary  divisors,  so  that  the  matrices  A  and  B  have 
one  and  the  ;same  real  part  of  the  spectrum. 
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^=F(ii),      IF(L^)=A^L^-L^BJ.  (33') 

Every  characteristic  value  of  the  linear  operator  F  (and  of  the  corre- 
sponding matrix  of  order  n^)  can  be  represented  in  the  form  of  a  difference 
y  —  d,  where  y  is  a  characteristic  value  of  Ai  and  d  a  characteristic  value 
of  Bi}^  Hence  it  follows  that  the  operator  F  has  only  real  characteristic 
values. 

We  denote  by 

(the  jij.  are  real ;  /i^  j^  X^  for  i  ^  j ;  i,  j  =  1,  2,  .  .  .  ,  i^)  the  minimal  polynomial 
of  F.  Then  the  solution  L^{t)  =  ^'U^^ oi  (33')  can,  by  formula  (12) 
(p.  116),  be  written  as  follows: 

ii(0=^  ^i««'e^*'.  (34) 

where  the  L^j  are  constant  matrices  of  order  n.  Since  the  matrix  Li{t)  is 
bounded  in  the  interval  (^o,  oo),  both  for  every  .^a;  >  0  and  for  Afc  =  0  and 
j  >  0,  the  corresponding  matrices  Luj  =  0.  We  denote  by  L-.{t)  the  sum  of 
all  the  terms  in  (34)  for  which  2^  <  0.    Then 

L^{t)  =  L-{t)  +  Lo,  (35) 

where 

lim  L-{t)  =  0,     lim   ^^^  =  0,     Lo=const.  (35') 

Then,  by  (35)  and  (35'), 

lim  Li{t)  =  LQ, 

^®  For  let  Ao  be  any  characteristic  value  of  the  operator  F.    Then  there  exists  a  matrix 
Ly^O  such  that  F(L)  =  AoL,  or 

{A^  —  AoE)L  =  LBr.  (*) 

The  matrices  ^i  —  AoE  and  Bx  have  at  least  one  characteristic  value  in  common,  since 
otherwise  there  would  exist  a  polynomial  gi'K)  such  that 

g(A,  —  AoE)  =  0,g(Bi)  =E, 

and  this  is  impossible,  because  it  follows  from  (*)  that  g(Ai  —  AoE)  •  L  =  L  •  g(Bi) 
and  L  ^  0.  But  if  A^.  —  AoE  and  B^.  have  a  common  characteristic  value,  then  Ao=  y —  6, 
where  y  and  3  are  characteristic  values  of  Ai  and  Bi,  respectively.  A  detailed  study  of 
the  operator  F  can  be  found  in  the  paper  [179]  by  F.  Golubchikov . 
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from  which  it  follows  that 

\Lo\^0, 

because  the  determinant  \  Li{t)  \  is  bounded  in  modulus  from  below. 

When  we  substitute  for  Li(t)  in  (33)  the  sum  L-{t)  +  Lq,  we  obtain: 

^%^  -  A^L^  {t)  +  B^L-  {t)  =  A^L^  -  B^L^ ; 

hence  by  (35') 

A^Lq-LqB^  =  0 
and  therefore 

B^=L-U^Lo,  (36) 

Conversely,  if  (36)  holds,  then  the  Lyapunov  transformation 

U=LoV 

carries  (31)  into  (32).     This  completes  the  proof  of  the  theorem. 

2.  From  this  theorem  it  follows  that:  Every  reducible  system  (19)  can  he 
carried  by  the  Lyapunov  transformation  X  —  LY  into  the  form 

dt  -*^^* 

where  J  is  a  Jordan  matrix  with  real  characteristic  values.  This  canonical 
form  of  the  system  is  uniquely  determined  by  the  given  matrix  P(t)  to 
within  the  order  of  the  diagonal  blocks  of  J. 
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1.    We  consider  a  system  of  differential  equations 

^  =  P{t)X,  (37) 


where  Pit)  =  I  pikit)  11 1   is  a  continuous  matrix  function  of  the  argument 
t  in  some  interval  (a,i).'^^ 


^9  (a,  &)  is  an  arbitrary  interval  (finite  or  infinite).  All  the  elements  Pik(t)  (i,  Tc-=. 
1,  2,  ...,  7i)  of  P(t)  are  complex  functions  of  the  real  argument  t,  continuous  in  (a,  &). 
Everything  that  follows  remains  valid  if,  instead  of  continuity,  we  require  (in  every  finite 
subinterval  of  (a,  fe))  only  boundedness  and  Riemann  integrability  of  all  the  functions 
Pi.(t). 
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We  use  the  method  of  successive  approximations  to  determine  a  normal- 
ized solution  of  (37),  i.e.,  a  solution  that  for  t  =  to  becomes  the  unit  matrix 
(^0  is  a  fixed  number  of  the  interval  (a,h)).  The  successive  approximations 
X]c  (/e  =  0,  1,  2,  .  .  .)  are  found  from  the  recurrence  relations 

^=P(«)X,_i         (k  =  l,2,...), 

when  Xo  is  taken  to  be  the  unit  matrix  E. 

Setting  X]c{to)  =^E  ik  =  0,  1,  2,  .  .  .)  we  may  represent  Xfc  in  the  form 

t 

to 

Thus 

t  t  t  T 

Xo  =  E,     X^=E+jP{T)dr,     X^  =  E -^ j P {r)dT -{-  j P {r) j P(o)dadr, . . . , 

to  to  to  to 

i.e.,  Xfc  {k  =  0,  1,  2,  .  .  .)  is  the  sum  of  the  first  k  +  1  terms  of  the  matrix 
series 

t  t  T 

E  +  j  P{r)dr  +  j  P{T)j  P{a)dadx  +  '" .  (38) 

to  to  to 

In  order  to  prove  that  this  series  is  absolutely  and  uniformly  convergent 
in  every  closed  subinterval  of  the  interval  ( o.^,  6 )  and  determines  the  required 
solution  of  (37),  we  construct  a  majorant. 

We  define  non-negative  functions  g{t)  and  }i{t)  in  {a,!))  by  the  equa- 
tions"" 

t 
gr(0=max[|2)ii(0|,  |2?i2(0|,  ...,  |2>„„(0|],     ^(t)  =\j  9  {i:)dr\. 

to 

It  is  easy  to  verify  that  g{i),  and  consequently  }i{t)  as  well,  is  continuous 
in  (a,  5). 21 

Each  of  the  n^  scalar  series  into  which  the  matrix  series  (38)  splits  is 
majorized  by  the  series 

l+,(,)  +  »4i9  +  ^  +  ....  (39) 


20  By  definition,  the  value  of  g{t)  for  any  value  of  t  is  the  largest  of  the  n"  moduli  of 
the  values  of  pik{t)  {i,  Tc  =  l,2,  .  .  . ,  n)  for  that  value  of  t. 

-1  The  continuity  of  g(t)  at  any  point  ti  of  the  interval  (a,  b)  follows  from  the  fact 
that  the  difference  g(t)  — giU)  for  t  sufficiently  near  ti  always  coincides  with  one  of 
the  n'  differences  |  pik{t)  \  —  \  Pik(U)  \  (i,  ]c  =  l,  2,  . . . ,  n). 


I 
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For 

t  t 

I  [fP  (T)  c^t),.,  I  =  I  I  Pi,  (T)  dr\^\    jg  (T)  dr\=h{t), 

to  to  to 

\{fP{r)fP{a)dadT)uk  I  =  I  Z  jVij[r)jv^,{a)dadT  \  ^n\  jg{T)jg{a)dadx\='^, 

to  to  /=l«o  <•  to  to 

etc. 

The  series  (39)  converges  in  {a,!))  and  converges  uniformly  in  every 
closed  part  of  this  interval.  Hence  it  follows  that  the  matrix  series  ( 38 )  also 
converges  in  {a,}))  and  does  so  absolutely  and  uniformly  in  every  closed 
interval  contained  in  {(1,1)). 

By  term-by-term  differentiation  we  verify  that  the  sum  of  (38)  is  a 
solution  of  (37);  this  solution  becomes  E  for  t  =  to.  The  term-by-term 
differentiation  of  (38)  is  permissible,  because  the  series  obtained  after  dif- 
ferentiation differs  from  (38)  by  the  factor  P  and  therefore,  like  (38),  is 
uniformly  convergent  in  every  closed  interval  contained  in  ( a,  6 ) . 

Thus  we  have  proved  the  theorem  on  the  existence  of  a  normal  solution 
of  (37).  This  solution  will  be  denoted  by  O't^iP)  or  simply  Q[^.  Every 
other  solution,  as  we  have  shown  in  §  1,  is  of  the  form 

where  C  is  an  arbitrary  constant  matrix.  From  this  formula  it  follows  that 
every  solution,  in  particular  the  normalized  one,  is  uniquely  determined  by 
its  value  for  t  =  to. 

This  normalized  solution  i^J  of  (37)  is  often  called  the  matricant. 

We  have  seen  that  the  matricant  can  be  represented  in  the  form  of  a 


QI  =  E+IP{t)  dr  +JP {r)j'P{a)  dadr+"',  (40) 

to  to  to 

which  converges  absolutely  and  uniformly  in  every  closed  interval  in  which 
P(t)'is  continuous. 

2.    We  mention  a  few  formulas  involving  the  matricant. 

1.  ^fo  =  ^l^ll     it^,t^,te{a,b)). 

For  since  Q^^  and  D^^  are  two  solutions  of  ( 37 ) ,  we  have 


"  The  representation  of  the  matricant  in  the  form  of  such  a  series  was  first  obtained 
by  Peano  [308]. 
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Q\^=Qt^C    (C  is  a  constant  matrix). 
Setting  t  =  ti  in  this  equation,  we  obtain  C  =  i3jj. 

2,      qUp+Q)=^Up)^Us)  with  s=[qUp)]-'qoUp)' 

To  derive  this  formula  we  set : 

X  =  Dl(P),     Y  =  Ql{P  +  Q), 
and 

Y=XZ.  (41) 

Differentiating  (41)  term  by  term,  we  find: 

{P  +  Q)XZ  =  PXZ  +  X~. 
Hence 

and  since  it  follows  from  (41)  that  Z(to)  ^E, 

Z=Q\{X-^QX). 

When  we  substitute  their  respective  matricants  for  X,  Y,  Z  in  (41),  we 
obtain  the  formula  2. 

t 

3.  \n\Q\,(P)\=j  irPdx. 

to 

This  formula  follows  from  the  Jacobi  identity  (4)    (p.  114)   when  we 
substitute  Ol^{P)  for  X(t)  in  that  identity. 

4.  If  A=\\  aiTcK  =  const.,  then 

We  introduce  the  following  notation.     If  P^=\\pik  lii ,  then  we  shall 
mean  by  mod  P  the  matrix 

modP=||  |p,,|  II?. 

Furthermore,  if  A  =  ||  Oik  ||i  and  B  =  ||  &ifc  jj"  are  two  real  matrices  and 

(hk^hik         {i,k  =  l,2,  .  .  .  ,n), 
then  we  shall  write 

A^B. 
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Then  it  follows  from  the  representation  (40)  that: 

5.  //  modP(0  ^modQ(t)  (t^to),  then  the  series  (40)  for  Q\^{P) 
is  majorized,  beginning  with  the  first  term,  by  the  same  series  for  Q\J^Q), 
so  that  for  all  t^to 

mod  Ql  (P)  ^  Ql  [Q) ,    mod  [<  (P)  -E]^  Q],  {Q)-E, 
t  t 

mod  {Q\^  (P)  -  E  -jPdT]  <  Q\,  [Q)  —  E  —JQdr,       etc . 

to  to 

In  what  follows  we  shall  denote  the  matrix  of  order  n  in  which  all  the 
elements  are  1  by  7 : 

We  consider  the  function  g{t)  defined  on  p.  126.    Then  we  have 
modP(0  ^9(t)I. 
But  Q\^{g{t)I)  is  the  normalized  solution  of  the  equation 


23 

•J 


Therefore,  by  4. 

fil(?W/)  =  e*«^=^+(MO  +  '^  +  ^  +  -)/,  (42) 

where 

t 

^(t)=!g(r)dr,     g(t)  =ma>x    \PiAt)\- 

Therefore  it  follows  from  5.  and  (42)  that: 

6.  mod^^(P)^^  +  -(e«'^(0-l)/, 

mod  [Ql  {P)-E]^-  (e«^<'>  - 1)  / , 

t 

mod[Ql(P)-E-fPdr]^-(e^(^'^^l-nh(t))I,      etc. 

ti 

We  shall  now  derive  an  important  formula  giving  an  estimate  for  the 
modulus  of  the  difference  between  two   matricants  : 


t 
By  replacing  the  independent  variable  t  by  h=  j g(J,)dt. 
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7.    mod  [Ql  (P)  -  Qi  {Q)]  ^  ^  e««<'-'o)  {e^i^-^o)  _  i)  /       (^  ^  g  , 

if 

modQ^ql,    mod  (P —  Q)  ^  d-I,    /=||1|| 

(q,d  are  non-negative  numbers ;  n  is  the  order  of  P  and  Q). 
We  denote  the  difference  P  —  Q  by  D.    Then 

P  =  Q+D,        mod  D^d-I. 
Using  the  expansion  (40)  of  the  matricant  in  a  series,  we  find: 
Ql{Q  +  D)-nUQ) 

t  t  r  t  r  t  X 

=^JD(T)dr-\-JD(r)JQ(o)dadT  +  JQ(r)JD(a)dadx+JD(T)JD(o)dadT  +  *" 

to  to  to  to  to  to  to 

From  this  expression  it  is  clear  that,  for  t  ^  to, 

mod  [Uo  (Q  +  D)-  Q\  (Q)\  ^  QI,  (mod  Q  +  mod  D)  -  Q^  (mod  Q ) 

^  ^\o  ((3  +  d)I)-  Q\  (ql)  =  c(*+^)  ^('-*«^  -  e?^<'-'«> 
=  e^ht-to)  (edi(t-to)  _^) 


E 


+  ^  (e««('-'o))  _  1)  /l  (end(«-to)  _  1)  / 
/  4. 1.  (e«?('-^o)  >- 1)  /2 1  (e'^it-to)  _  1) 


—  _L  gng{t-to)  (gnd(<-fo)  —  1)  /  . 

We  shall  now  show  how  to  express  by  means  of  the  matricant  the  general 
solution  of  a  system  of  linear  differential  equations  with  right-hand  sides : 

ii=i:PiAt)^k  +  fiW         (^  =  1,2,..., 71);  (43) 

Pik{t)  and  fi{t)   {i,  k  =  l,  2,  .  .  . ,  n)  are  continuous  functions  of  /  in  some 
interval. 

By  introducing  the  column  matrices  (* vectors')  x=^  (xi,  X2,  .  .  . ,  Xn)  and 
f={fi,f2,--',fn)  and  the  square  matrix  P=  11  pik  lli ,  we  write  the  system 
as  follows : 

~  =  P{t)x  +  f(t).  (430 
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We  shall  look  for  a  solution  of  this  equation  in  the  form 

x=DUP)^,  (44) 

where  z  is  an  unknown  column  depending  on  t.    We  substitute  this  expres- 
sion for  X  in  (43')  and  obtain : 


hence 


^=W>.(P)-]-'f(.t) 


Integrating  this,  we  find : 

t 
z=j[Ql(P)y-^f(r)dr-\-c. 

to 

where  c  is  an  arbitrary  constant  vector.     Substituting  this  expression  in 
(44),  we  obtain: 

x=  Q\ (P)  /  VQl {P)]-^f  (r)  dr  +  U, (P)  c.  (45) 

to 

When  we  give  to  t  the  value  to,  we  find :  x{to)  =  c.    Therefore  (45)  assumes 

the  form 

t 

X  =  Q\,  (P)  X  (g  +  JK  (t,  t)  /  (t)  dT ,  (45') 

to 

where 

K(t,r)=QUP)[Ql(P)li-' 

is  the  so-called  Cauchy  matrix. 
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of  Volterra 

1.  Let  us  consider  the  matricant  Q\{F).  We  divide  the  basic  interval 
{to,t)  into  n  parts  by  introducing  intermediate  points  ti,  t2,  .  .  . ,  tn-i  and 
set  Atjc  =  t]c  —  tjc-i  {k  =  l,  2,  .  .  . ,  n;  tn^t).  Then  by  property  1.  of  the 
matricant  (see  the  preceding  section), 

Q',=Ol_^---0^fil  (46) 
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In  the  interval  {tk-i,  ik)  we  choose  an  intermediate  point  Tu{k  =  l,2, . .  .  ,n). 
By  regarding  the  Atk  as  small  quantities  of  the  first  order  we  can  take,  for 
the  computation  of  Qfu_.  to  within  small  quantities  of  the  second  order, 
P(0  «  const.  =  P(rfc).    Then 

i3;*_^=c^(^*)^**+  (**)  =-£?+  P(T;fc)  Atj,^-(**)\  (47) 

here  we  denote  by  the  symbol  (**)  the  sum  of  terms  beginning  with  terms 
of  the  second  order. 

From  (46)  and  (47)  we  find: 

Q\^  =eP{yn)Mn  . . .  eP(T,)/f«,eP(r,)  At,  ^  (^)  (48) 

and 

Q\,  =  [E  +  P  (r J  AQ  '"[E  +  P  (r^)  At^]  [E  +  P  {r^)  At^]  +  (*) .      (49) 

When  we  pass  to  the  limit  by  increasing  the  number  of  intervals  indefi- 
nitely and  letting  the  length  of  these  intervals  tend  to  zero  (the  small  terms 
(*)  disappear  in  the  limit ),2*  we  obtain  the  exact  limit  formulas 


Ql(P)  =  lim  [e^(^n)^<»  . . .  eP(r2)^<2e^('^i>^'^]  (48') 

and 

Ql (P)=hm[E+P  (rj  Jg  •  •  •  [^  +  P  (Tg)  At^]  [E  +  P  (Tj)  At^].    (49') 

The  expression  under  the  limit  sign  on  the  right-hand  side  of  the  latter 
equation  is  the  product  integral}^  We  shall  call  its  limit  the  multiplicative 
integral  and  denote  it  by  the  symbol 

TaE  +  P  (t)  dt]  =  ]mi[E+P  (tJ  AtJ"-[E  +  P  (tJ  At  J .  (50) 

The  formula  (49')  gives  a  representation  of  the  matricant  in  the  form  of  a 
multiplicative  integral 

^UP)  =  JliE+Pdt),  (51) 

and  the  formulas  (48)  and  (49)  may  be  used  for  the  approximative  compu- 
tation of  the  matricant. 


24  These  arguments  can  be  made  more  precise  by  an  estimate  of  the  terms  we  have 
denoted  by  (*),  For  a  rigorous  deduction  of  (48')  we  have  to  use  formula  7.  of  §  5  in 
which  the  matricant  Q{t)  must  be  replaced  by  a  piece- wise  constant  matrix 

Q(t)  =  P(tH)  (U-i^t^U;  Jc  =  l,2,...,n). 

25  An  analogue  to  the  sum  integral  for  the  ordinary  integral. 
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The  multiplicative  integral  was  first  introduced  by  Volterra  in  1887. 
On  the  basis  of  this  concept  Volterra  developed  an  original  infinitesimal 
calculus  for  matrix  functions  (see  [63]  ).^^ 

The  whole  peculiarity  of  the  multiplicative  integral  is  tied  up  with  the 
fact  that  the  various  values  of  the  matrix  function  P{t)  in  subintervals  are 
not  permutable.    In  the  very  special  case  when  all  these  values  are  permutable 

p  in  p  {n = p  in  p  in     {t\  r  e  {t^,  o), 

the  multiplicative  integral,  as  is  clear  from  (48')  and  (51),  reduces  to  the 
matrix 

t 
f  p(t)dt 

2.    We  now  introduce  the  multiplicative  derivative 

D,X  =  ^X-\  (62) 

The  operations  Dt  and   ^  are  mutually  inverse : 

If  ^" 

D,X=P. 

then" 

X=ll(E+Pdt)-C        (C  =  X(g), 

and  vice  versa.    The  last  formula  can  also  be  written  as  follows :  ^® 

JI^{E  +  Pdt)=X{t)Xit,)-K  (53) 

We  leave  it  to  the  reader  to  verify  the  following  differential  and  integral 
formulas  :^® 


26  The  multiplicative  integral  (in  German,  ProduTct-Integral)  was  used  by  Schlesinger 
in  investigating  systems  of  linear  differential  equations  with  analytic  coefficients  [49] 
and  [50];  see  also  [321]. 

The  multiplicative  integral  (50)  exists  not  only  for  a  function  P(t)  that  is  continuous 
in  the  interval  of  integration,  but  also  under  considerably  more  general  conditions 
(see  [116]). 

2*^  Here  the  arbitrary  constant  matrix  C  is  an  analogue  to  the  arbitrary  additive  con- 
stant in  the  ordinary  indefinite  integral. 

28  An  analogue  to  the  formula    f  Pdt  =  X(t)  —  Z(to),  where  -^  z=P. 

to 

29  These  formulas  can  be  deduced  immediately  from  the  definitions  of  the  multiplica- 
tive derivative  and  multiplicative  integral  (see  [63]  ).  However,  the  integral  formulas  are 
obtained  more  quickly  and  simply  if  the  multiplicative  integral  is  regarded  as  a  matricant 
and  the  properties  of  the  matricant  that  were  expounded  in  the  preceding  section  are  used 
(see  [49]). 
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DIFFERENTIAL   FORMULAS 

I.  A  {XY)  =D,{X)  +  XD,  (D  X-\ 

II.  A  (Z"")  =  X^  {D^Y  X'^-K 
III.  D,{X-^)  =-X-'DAX)X  =  -(D,{X-')y, 

INTEGRAL  FORMULAS 

IV.]]^ {E  +  PdJr)  =7;^  {E  +  PdT)]l{E  +Pdr). 
V.  ][  (E  +  PdT)=  []';  (E  +  Pdr)]-"^. 

YL.][  (E  +  GPO-^  dr)  =  Gj[^{D  +  P<?t)  C-i  (C  is  a  constant  matrix) 
VII.  7;^  [E  +  (Q  +  DtX)  dr]  =  X  {tfjl  (E  +  X-'QX  dx)  X  (t^)-\  ^r 

Vm.  mod[l[  {E+P dr)  -][^(E  +  Q  dr)]  ^  ^ c««('-^o)  (e«^('-^o)  -1)7  {t>  g, 


*/ 


modQ^Q-Z,        mod{P  —  Q)^d-I,        /=  11  1 


( g'  and  d  are  non-negative  numbers ;  n  is  the  order  of  P  and  Q ) . 

Suppose  now  that  the  matrices  P  and  Q  depend  on  the  same  parameter  a 

P=P(T,a),       Q  =  Q{r,oc) 

and  that 

lim  P  (t,  a)  =  limQ  (r,  a)=  Pq  (t), 

where  the  limit  is  approached  uniformly  with  respect  to  r  in  the  interval 
ito,t)  in  question.  Furthermore,  let  us  assume  that  for  a^-  ao  the  matrix 
Q(r,a)  is  bounded  in  modulus  by  ql,  where  g  is  a  positive  constant.  Then, 
setting 

hmd{a)  =  0, 

we  have :  _ ,  .  i       ,       a  /      m 

d  (a)  =    max     |  Pi^  (t,  a)  —  g<i  (t,  a)  . 


31  The  formula  VII  can  be  regarded  in  a  certain  sense  as  a  analogue  to  the  formula  for 
integration  by  parts  in  ordinary  (non-multiplicative)  integrals.  VII  follows  from  2. 
of  §5). 
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Therefore  it  follows  from  formula  VIII  that : 

lim  [Jl  [E  +  P  dt)  -]l  {E  +  Q  dt)]  =0. 
In  particular,  if  Q  does  not  depend  on  a  {Q{t,  a)  =Po{t)),  v^^e  obtain: 

Jim  ][  [E+P  {t,  a)  dt]  =]l  [E  +  P^  (t)  dt], 
Pq  (t)  =  ]im  P  {t,  a) . 


where  "*  ° 


a-^otj 


§  7.    Differential  Systems  in  a  Complex  Domain.     General  Properties 

1.    We  consider  a  system  of  differential  equations 

§  =  i'K*W^*-  (54) 

Here  the  given  function  pudz)  and  the  unknown  functions  Xi{z)  (i,k  = 
1,  2,  .  .  . ,  n)  are  supposed  to  be  single-valued  analytic  functions  of  a  complex 
argument  z,  regular  in  a  domain  G  of  the  complex  2:-plane. 

Introducing  the  square  matrix  P(2;)  —  \\  Pik(z)  11 1  and  the  column  matrix 
x^  (xi,  X2,  .  . . ,  Xn),  we  can  write  the  system  (54),  as  in  the  case  of  a  real 
argument  ( §  1 ) ,  in  the  form 

|  =  P(.):r  (54') 

Denoting  an  integral  matrix,  i.e.,  a  matrix  whose  columns  are  n  linearly 
independent  solutions  of  (54),  by  X,  we  can  write  instead  of  (54')  : 

'§  =  P(z)X  (55) 

Jacobi  's  formula  holds  also  for  a  complex  argument  z : 

t 

f  tr  Pdz 

|X|  =  ce'o  (56) 

z 

Here  it  is  assumed  that  Zo  and  all  the  points  of  the  path  along  which  j  is 
taken  are  regular  points  for  the  single- valued  analytic  function  ivP{z)  = 


32  Here,  and  in  what  follows,  the  path  of  integration  is  taken  as  a  sectionally  smooth 
curve. 
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2.  A  peculiar  feature  of  the  ease  of  a  complex  argument  is  the  fact  that  for 
a  single- valued  function  P{z)  the  integral  matrix  X{z)  may  well  be  a  many- 
valued  function  of  z. 

As  an  example,  we  consider  the  Cauchy  system 

J  V  J  J 

-—  = X        ( C7  is  a  constant  matrix) .  (57) 

One  of  the  solutions  of  this  system,  as  in  the  case  of  a  real  argument  (see 
p.  115),  is  the  integral  matrix 

X=e^in(.-«)^(2;_a)^.  (58) 

For  the  domain  G  we  take  the  whole  2;-plane  except  the  point  z  =  a.  All  the 
points  of  this  domain  are  regular  points  of  the  coefficient  matrix 

If  U  y^O,  then  z  =  a  is  a,  singular  point  (a  pole  of  the  first  order)  of  the 
matrix  function  P(z)  =TJ/(z  —  a). 

An  element  of  the  integral  matrix  (58)  after  going  around  the  point 
z  =  a  once  in  the  positive  direction  returns  with  a  new  value  which  is  obtained 
from  the  old  one  by  multiplication  on  the  right  by  the  constant  matrix 

V  =  e2m?7 

In  the  general  case  of  a  system  (55)  we  see,  by  the  same  reasoning  as  in 
the  case  of  a  real  argument,  that  two  single-valued  solutions  X  and  X  are 
always  connected  in  some  part  of  the  domain  G  by  the  formula 

x=xc, 

where  C  is  a  constant  matrix.  This  formula  remains  valid  under  any 
analytic  continuation  of  the  functions  X{z)  and  X{z)  in  G. 

The  proof  of  the  theorem  on  the  existence  and  (for  given  initial  values) 
uniqueness  of  the  solution  of  (54)  is  similar  to  that  of  the  real  case. 

Let  us  consider  a  simply-connected  star  domain  Gi  (relative  to  ^o)'^^ 
forming  part  of  G  and  let  the  matrix  function  P(z)  be  regular^^  in  G^i.  We 
form  the  series 

z  z  z 

^+/p(C)(^-f/p(C)/p(r)C'^+---.  (59) 


33  A  domain  is  called  a  star  domain  relative  to  a  point  Zo  if  every  segment  joining  Zo 
to  an  arbitrary  point  z  of  the  domain  lies  entirely  in  the  given  domain. 

34  I.e.,  all  the  elements  pik(z)  (i,'k  =  l,  2,  .  .  . ,  n)  of  P(z)  are  regular  functions  in  Gu 
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Since  Gi  is  simply-connected,  it  follows  that  every  integral  that  occurs  in 
(59)  is  independent  of  the  path  of  integration  and  is  a  regular  function  in 
6^1.  Since  Gi  is  a  star  domain  relative  to  Zo,  we  may  assume  for  the  purpose 
of  an  estimate  of  the  moduli  of  these  integrals  that  they  are  all  taken  along 
the  straight-line  segment  joining  Zo  and  z. 

That  the  series  (59)  converges  absolutely  and  uniformly  in  every  closed 
part  of  Gi  containing  Zo  follows  from  the  convergence  of  the  majorant 


l  +  lM-{-^im^  +  ^PM^  +  '' 


Here  M  is  an  upper  bound  for  the  modulus  of  P(z)  and  I  an  upper  bound 
for  the  distance  of  z  from  Zo,  and  both  bounds  refer  to  the  closed  part  of 
6ri  in  question. 

By  differentiating  term  by  term  we  verify  that  the  sum  of  the  series 
(59)  is  a  solution  of  (55).  This  solution  is  normalized,  because  for  z  =  Zo 
it  reduces  to  the  unit  matrix  E.  The  single-valued  normalized  solution  of 
(55)  will  be  called,  as  in  the  real  case,  a  matricant  and  will  be  denoted  by 
Qto{P).  Thus  we  have  obtained  a  representation  of  the  matricant  in  Gi  in 
the  form  of  a  series^ ^ 

Q'JP)  =  E+jp[^)d^+fP{OlP{ndi'dC  +  --:  (60) 

to  «o  'o 

The  properties  1.-4.  of  the  matricant  that  were  set  up  in  ^5  automatically 
carry  over  to  the  case  of  a  complex  argument. 

Any  solution  of  (55)  that  is  regular  in  G  and  reduces  to  the  matrix  Xo 
for  z  =  Zo  can  be  represented  in  the  form 

X  =  Diy)'C         (C  =  Xo).  (61) 

The  formula  (61)  comprises  all  single-valued  solutions  that  are  regular 
in  a  neighborhood  of  Zo  (2:0  is  a  regular  point  of  the  coefficient  matrix  P{z)). 
These  solutions  when  continued  analytically  in  G  give  all  the  solutions  of 
(55)  ;  i.e.,  the  equation  (55)  cannot  have  any  solutions  for  which  Zo  w^ould 
be  a  singular  point. 

For  the  analytic  continuation  of  the  matricant  in  G  it  is  convenient  to 
use  the  multiplicative  integral. 


35  Our  proof  for  the  existence  of  a  normalized  solution  and  its  representation  in  Gi 
by  the  series  (60)  remains  valid  if  instead  of  the  assumption  that  the  domain  is  a  star 
dpmain  we  make  a  wider  assumption,  namely,  that  for  every  closed  part  of  d  there  exists 
a  positive  number  I  such  that  every  point  s  of  this  closed  part  can  be  joined  to  Zo  by  a  path 
of  length  not  exceeding  I. 
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§  8.    The  Multiplicative  Integral  in  a  Complex  Domain 

1.  The  multiplicative  integral  along  a  curve  in  the  complex  plane  is  defined 
in  the  following  way. 

Suppose  that  L  is  some  path  and  P{z)  a^  matrix  function,  continuous  on 
L.  We  divide  the  path  L  into  n  parts  {zq,Zx),  {zy,Z2),  . . . ,  {Zn-i,Zn)  ;  here 
Zq  is  the  beginning,  and  Zn^z  the  end  of  the  path,  and  z^,  2:2,  ... ,  Zn-i  are 
intermediate  points  of  division.  On  the  segment  z^-iZu  we  take  an  arbitrary 
point  Cfc  and  we  use  the  notation  Azk=^Z]c  —  Z]c-i  (A:  =  l,  2,  .  .  . ,  n).  We 
then  define 

When  we  compare  this  definition  with  that  on  p.  132,  we  see  that  they 
coincide  in  the  special  case  where  L  is  a  segment  of  the  real  axis.  However, 
even  in  the  general  case,  where  L  is  located  anywhere  in  the  complex  plane, 
the  new  definition  may  be  reduced  to  the  old  one  by  a  change  of  the  variable 
of  integration. 

If 

z=z{t) 

is  a  parametric  equation  of  the  path,  where  z{t)  is  a  continuous  function 

in  the  interval  {io,i)  with  a  piece-wise  continuous  derivative  ^^l ,  then  it  is 
easy  to  see  that 

/[£  +  P  (z)  dz]  =f^  [e+piz  m  ^  * j . 

This  formula  shows  that  the  multiplicative  integral  along  an  arbitrary 
path  exists  if  the  matrix  Piz)  under  the  integral  sign  is  continuous  along 
this  path.^^ 

2.  The  multiplicative  derivative  is  defined  by  the  previous  formula 

Here  it  is  assumed  that  X(z)  is  an  analytic  function. 

All  the  differential  formulas  (I-III)  of  the  preceding  section  carry  over 
without  change  to  the  case  of  a  complex  argument.  As  regards  the  integral 
formulas  IV-VI,  their  outward  form  has  to  be  modified  somewhat: 


36  See  footnote  26.     Even  when  P(z)  is  continuous  along  L,  the  function  P[z(t)]-^ 

may  only  be  sectionally  continuous.     In  this  case  we  can  split  the  interval   (to,t)  into 

partial  intervals  in  each  of  which  the  derivative  -^-  is  continuous  and  can  interpret  the 

at 

integral  from  U  to  t  as  the  sum  of  the  integrals  along  these  partial  intervals. 


§  8.  Multiplicative  Integral  in  Complex  Domain  139 


IV'.      f    {E  +  Pdz)=J{E  +  Pdz)JiE  4-  Pdz). 

—L  L 

VI'.  J{E  +  CPC-^dz)=C f{E+  Pdz)G-^Cis  a  constant  matrix). 


In  IV'  we  have  denoted  by  U  +  X"  the  composite  path  that  is  obtained 
by  traversing  first  L'  and  then  L".  In  V,  —  L  denotes  the  path  that  differs 
from  L  only  in  direction. 

The  formula  VII  now  assumes  the  form 

VII'.  J[E  +  (Q  +  D,X)  dz]  -=X  (z)  J{E  +  X-'QX  dz)  X  {zo)-\ 

L  L 

Here  X{zq)  and  X{z)  on  the  right-hand  side  denote  the  values  of  X(2;)  at 
the  beginning  and  at  the  end  of  L,  respectively. 
Formula  VIII  is  now  replaced  by  the  formula 

Vlll'.mod  [J  {E  ^-  P  dz)-]  {E  +  Q  dz)]^\;e'^^  {e"^'^  —  \)I, 

L  L 

where  mod  Q  ^  ql,  mod  (P  —  Q)  ^  d*/,  7=  ||  1  ||,  and  I  is  the  length  of  L. 

VIII'  is  easily  obtained  from  VIII  if  we  make  a  change  of  variable  in  the 

latter  and  take  as  the  new  variable  of  integration  the  arc-length  s  along  L 


(with 


dz  I 

d8\  ^^• 


3.  As  in  the  case  of  a  real  argument,  there  exists  a  close  connection  between 
the  multiplicative  integral  and  the  matricant. 

Suppose  that  P{z)  is  a  single-valued  analytic  matrix  function,  regular 
in  (t,  and  that  Go  is  a  simply-connected  domain  containing  Zq  and  forming 
part  of  G.    Then  the  matricant  Ql^{P)  is  a  regular  function  of  z  in  G^o- 

We  join  the  points  Zo  and  z  by  an  arbitrary  path  L  lying  entirely  in  Go 
and  we  choose  on  L  intermediate  points  Z\,  Z2,  . .  . ,  Zn-i-  Then,  using  the 
equation 

and  proceeding  to  the  limit  exactly  as  in  §  6  (p.  132),  we  obtain : 
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Ql  (P)  =J{E+P  dz)  =11  {E  +  P  dz) .  (62) 

L 

From  this  formula  it  is  clear  that  the  multiplicative  integral  depends  not 
on  the  form  of  the  path,  but  only  on  the  initial  point  and  the  end  point  if 
the  whole  path  of  integration  lies  in  the  simply-connected  domain  Go  within 
which  the  integrand  P{z)  is  regular.  In  particular,  for  a  closed  contour  L 
in  Go,  we  have  : 

S){E^Pdz)=E.  (63) 

This  formula  is  an  analogue  to  Cauchy's  well-known  theorem  according 
to  which  the  ordinary  (non -multiplicative)  integral  along  a  closed  contour 
is  zero  if  the  contour  lies  in  a  simply-connected  domain  within  which  the 
integrand  is  regular. 

4.  The  representation  of  the  matricant  in  the  form  of  the  multiplicative 
integral  (62)  can  be  used  for  the  analytic  continuation  of  the  matricant 
along  an  arbitrary  path  L  in  G.    In  this  case  the  formula 

X  =Jl{E  +  P  dz)  X^  (64) 

gives  all  those  branches  of  the  many-valued  integral  matrix  X  of  the  differ- 

dX 
ential  equation  -j^  =  PX  that  for  z^Zo  reduce  to  Xo  on  one  of  the  branches. 

The  various  branches  are  obtained  by  taking  account  of  the  various  paths 
joining  Zo  and  z. 

By  Jacobi's  formula  (56) 

/  ix  Pdz 


and,  in  particular,  for  Xo  =  E, 


I  tr  Pdz 

{E  +  Pdz)\=e'o  (65) 


From  this  formula  it  follows  that  the  multiplicative  integral  is  always 
a  non-singular  matrix  provided  only  that  the  path  of  integration  lies  entirely 
in  a  domain  in  which  P(z)  is  regular. 

If  L  is  an  arbitrary  closed  path  in  G  and  G  is  not  a  simply-connected 
domain,  then  (63)  cannot  hold.    Moreover,  the  value  of  the  integral 
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{E  +  Pdz) 

is  not  determined  by  specification  of  the  integrand  and  the  closed  path  of 
integration  L  but  also  depends  on  the  choice  of  the  initial  point  of  integra- 
tion Zo  on  L.  For  let  us  take  on  the  closed  curve  L  two  points  Zo  and  Zi  and 
let  us  denote  the  portions  of  the  path  from  Zo  to  Zi  and  from  Zi  to  Zo  (in  the 
direction  of  integration)  by  Li  and  L2,  respectively.  Then,  by  the  for- 
mula TV," 

H-l  hi] 

Zq         L2      Li  Zi  Li      Xj 

and  therefore 


=/.f /"\  (66) 


Zi  Li       Zq      Li 

The  formula  (66)  shows  that  the  symbol  ^  (E  +  Pdz)  determines  a  cer- 
tain matrix  to  within  a  similarity  transformation,  i.e.,  determines  only  the 
elementary  divisors  of  that  matrix. 

We  consider  an  element  X{z)  of  the  solution  (64)  in  a  neighborhood  of 
Zo.  Let  L  be  an  arbitrary  closed  path  in  G  beginning  and  ending  at  Zq.  After 
analytic  continuation  along  L  the  element  X{z)  goes  over  into  an  element 
X{z).  But  the  new  element  X{z)  satisfies  the  same  differential  equation 
(55),  since  P{z)  is  a  single-valued  function  in  G.    Therefore 

x=xv, 

where  F  is  a  non-singular  constant  matrix.    From  (64)  it  follows  that 


X{zQ)  =  &){E  +  Pdz)XQ. 
Comparing  this  equation  with  the  preceding  one,  we  find: 

V  =  X-^j){E  +  Pdz)X^.  (67) 

In  particular,  for  the  matricant  X  =Ql^,  we  have  Xo  =  E,  and  then 

V  =  I{E  +  Pdz).  (68) 

•?0 


s''  To  simplify  the  notation  we  have  omitted  the  expression  to  be  integrated,  E  +  Pdz, 
which  is  the  same  for  all  the  integrals. 
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§  9.     Isolated  Singular  Points 

1.  We  shall  now  deal  with  the  behavior  of  a  solution  (an  integral  matrix) 
in  a  neighborhood  of  an  isolated  singular  point  a. 

Let  the  matrix  function  P{z)  be  regular  for  the  values  of  z  satisfying 
the  inequality 

0  <\z  —  a\<B. 

The  set  of  these  values  forms  a  doubly-connected  domain  G.    The  matrix 
function  P{z)  has  in  G  an  expansion  in  a  Laurent  series 

P{z)=   f  Pn{z-aT.  (69) 

7l=  — oo 

An  element  X{z)  of  the  integral  matrix,  after  going  once  around  a  in 
the  positive  direction  along  a  path  L,  goes  over  into  an  element 

X-^{z)=X(z)V  , 

where  "T  is  a  constant  non-singular  matrix. 

Let  Z7  be  the  constant  matrix  that  is  connected  with  T  by  the  relation 

y^^2nW^  (70) 

Then  the  matrix  function  {z  —  a)^  after  going  around  a  along  L  goes 
over  into  {z  —  a)^V.    Therefore  the  matrix  function 

F{z)  =  X{z){z-a)-^  ,  (71) 

which  is  analytic  in  G,  goes  over  into  itself  (remains  unchanged)  by  analytic 
continuation  along  L}^  Therefore  the  matrix  function  F{z)  is  regular  in  G 
and  can  be  expanded  in  6^  in  a  Laurent  series 

F(z)=  f  F,(z-aT.  (72) 

n  =  — oo 

From  (71)  it  follows  that: 

X{z)=F[z){z  —  aY .  (73) 

Thus  every  integral  matrix  X{z)  can  be  represented  in  the  form  (73), 
where  the  single-valued  function  F{z)  and  the  constant  matrix  TJ  depend  on 


38  Hence  it  follows  that  when  z  traverses  any  other  closed  path  in  G,  the  function  F{z) 
returns  to  its  original  value. 


§  9.  Isolated  Singular  Points  143 

the  coefficient  matrix  P(z).  However,  the  algorithmic  determination  of  U 
and  of  the  coefficients  Fn  in  (72)  from  the  coefficients  P„  in  (69)  is,  in 
general,  a  complicated  task. 

A  special  case  of  the  problem,  where 

oo 

n=-\ 

will  be  analyzed  completely  in  §  10.     In  this  case,  the  point  a  is  called  a 
regular  singularity  of  the  system  (55). 
If  the  expansion  (69)  has  the  form 

oo 

P{z)=  2J  PA^-aT       {q>l;    P^,¥=0) 

n=  —  q 

then  a  is  called  an  irregular  singularity  of  the  type  of  a  pole.  Finally,  if 
there  is  an  infinity  of  non-zero  matrix  coefficients  P„  with  negative  powers 
oi  z  —  a  in  (69),  then  a  is  called  an  essential  singularity  of  the  given  differ- 
ential system. 

From  (73)  it  follows  that  under  an  arbitrary  single  circuit  in  the  posi- 
tive direction  (along  some  closed  path  L)  an  integral  matrix  X{z)  is  multi- 
plied on  the  right  by  one  and  the  same  matrix 

If  this  circuit  begins  (and  ends)  at  Zq,  then  by  (67) 

F=  X(zo)-^  Z{E^  Pdz)  X(zo) .  (74) 


2o 

If  instead  oi  X{z)  we  consider  any  other  integral  matrix  X{z)  =X{z)C 
(C  is  a  constant  matrix ;  |  0  |  y^  0),  then,  as  is  clear  from  (74),  Y  is  replaced 
by  the  similar  matrix 

Thus,  the  'integral  substitutions'  V  of  the  given  system  form  a  class  of 
similar  matrices. 

From  (74)  it  also  follows  that  the  integral 

(E  +  Pdz)  (75) 

is  determined  by  the  initial  point  Zo  and  does  not  depend  on  the  form  of  the 
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curved  path.^^     If  we  change  the  point  Zo,  then  the  various  values  of  the 
integral  that  are  so  obtained  are  similar.^^ 

These  properties  of  the  integral  (75)  can 
also  be  confirmed  directly.  For  let  L  and  V 
be  two  closed  paths  in  G  around  z  =  a  with  the 
initial  points  Zo  and  zj  (see  Fig.  6). 

The  doubly-connected  domain  between  L 
and  V  can  be  made  simply-connected  by  intro- 
ducing the  cut  from  Zq  to  Zo  .  The  integral 
along  the  cut  will  be  denoted  by 


T=ll{Ei-Pdz). 


Fig.  6 


Since  the  multiplicative  integral  along  a  closed  contour  of  a  simply- 
connected  domain  is  E,  we  have 


hence 


U       L 

J=tJt-k 


Thus,  the  integral  ^{E  +  Pdz),  like  V,  is  determined  to  within  similarity, 
and  we  shall  occasionally  write  (74)  in  the  form 


{E  +  Pdz); 

meaning  that  the  elementary  divisors  of  the  matrices  on  the  left-hand  and 
right-hand  sides  of  the  equation  coincide. 

2.    As  an  example,  we  consider  a  system  with  a  regular  singularity 


where 


Let 


f  =  P(.)X 


n=0 
^-1 


•■^9  Under  the  condition,  of  course,  that  the  path  of  integration  goes  around  a  once  in 
the  positive  direction. 

40  This  follows  from  (74),  or  from  (66). 
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Using  the  formula  VIII'  of  the  preceding  section,  we  estimate  the  modulus 
of  the  difference 

D=S){E  +  Pdz)  —  j){E  +  Qdz),  (76) 

taking  as  path  of  integration  a  circle  of  radius  r  (r  <_  R)  in  the  positive 
direction.    Then  with 

modP_i^p_J,        mod   2J  Pni^-aT  ^d{r)  I ,     I=\\l\\; 

\z—a\  =r  n=0 

we  set  in  VIII' : 

q=^,     d=d{r),     l=2nr 
and  then  obtain 

modD  <.  —  e^'^P-i  U^^nrd^r)  _  1)  / . 

Hence  it  is  clear  that*^ 

limD-O.  (77) 

On  the  other  hand,  the  system 

is  a  Cauchy  system,  and  in  that  case  we  have  for  an  arbitrary  choice  of  the 
initial  point  Zq  and  for  every  r  <^E 


^{E-[-Qdz)  =  e^"^^-^. 


Therefore  it  follows  from  (76)  and  (77)  that: 


]mi(h{E  +  Pdz)=^  e2^^"^-i .  (78) 


r-*-0  -J 

^0 


But  the  elementary  divisors  of  the  integral  (p  (E  +  Pdz)  do  not  depend  on 

00  and  r  and  coincide  with  those  of  the  integral  substitution  Y. 

From  this  Volterra  in  his  well-known  memoir  (see  [374])  and  his  book 
[63]  (pp.  117-120)  deduces  that  the  matrices  Y  and  e^^'^-'are  similar,  so 
that  the  integral  substitution  Y  is  determined  to  within  similarity  by  the 
'residue'  matrix  P_i. 

But  this  assertion  of  Yolterra  is  incorrect. 


*i  Here  we  have  used  the  fact  that  for  a  suitable  choice  of  d(r) 

liind(r)  =r  do, 
r-fO 

where  do  is  the  greatest  of  the  moduli  of  the  elements  of  Po. 
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From  (74)  and  (78)  we  can  only  deduce  that  the  characteristic  values 
of  the  integral  substitution  V  coincide  with  those  of  the  matrix  e^''^^~^.  How- 
ever, the  elementary  divisors  of  these  matrices  may  be  distinct.  For  example, 
for  every  r  ^  0  the  matrix 

a     r 

0     a 

has  one  elementary  divisor  (A  —  a)^,  but  the  limit  of  the  matrix  for  r-^  0, 
i.e.,  the  matrix  ||  J  2  ||  >  ^^-s  two  elementary  divisors  X  —  a,  I  —  a. 

Thus,  Yolterra's  assertion  does  not  follow  from  (74)  and  (78).  It  is  not 
even  true  in  general,  as  the  following  example  shows. 

Let 


P{z)  = 


The  corresponding  system  of  differential  equations  has  the  form 

dz  2'  ^2  z  ' 

Integrating  the  system  we  find : 

Xi  =  c]nz  +  d,         X2=  — . 


0 

0 

i+ 

0     1 

0 

—  1 

z 

0     0 

The  integral  matrix 


Xiz) 


]nz     1 

z-i      0 


when  the  singular  point  2;  =  0  is  encircled  once  in  the  positive  direction,  is 
multiplied  on  the  right  by  the  matrix 


V  = 


1       0 

27ti     1 


This  matrix  has  one  elementary  divisor  (A  —  1)^.     At  the  same  time  the 
matrix 


f2niP^i  =^  p2ni  \\  0    — I  ||  r=: 


1     0 
0     1 


=  E 


has  two  elementary  divisors  X  —  1,  X  —  1. 

3.  We  now  consider  the  case  where  the  matrix  P{z)  has  a  finite  number 
of  negative  powers  of  z  —  a  (a  is  a  regular  or  irregular  singularity  of  the 
type  of  a  pole)  : 
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n  =  0 


We  transform  the  given  system 

dX 


by  setting 


dz 

X=-A{z)Y 


(79) 
(80) 


where  A{z)  is  a  matrix  function  that  is  regular  at  2:  =  0  and  assumes  there 
the  value  E : 

A{z)  =  E  +  A^{z  —  a)  +  A<^{z-a)^^"' ; 

the  power  series  on  the  right-hand  side  converges  for  \z  —  a\  <  ri. 

The  well-known  American  mathematician  G.  D.  Birkhoff  has  published 
a  theorem  in  1913  (see  [117])  according  to  which  the  transformation  (80) 
can  always  be  chosen  such  that  the  coefficient  matrix  of  the  transformed 
system 

dY 


dz 


=  P*{z)Y 


(79') 


contains  only  negative  powers  oi  z  —  a : 


P*{z) 


{z-a)9 


+ 


P-l 


Birkhoff 's  theorem  with  its  complete  proof  is  reproduced  in  the  book 
Ordinary  Differential  Equations,  by  E.  L.  Ince.*^  Moreover,  on  the  basis 
of  these  'canonical'  systems  (79')  he  investigates  the  behavior  of  the  solution 
of  an  arbitrary  system  in  the  neighborhood  of  a  singular  point. 

Nevertheless,  Birkhoff  ^s  proof  contains  an  error,  and  the  theorem  is  not 
true.  As  a  counter-example  we  can  take  the  same  example  by  which  we 
have  above  refuted  Volterra's  claim.*^ 

In  this  example  q  =  l,  a  =  0  and 


,P^=0      for  71  =  1,2, 


0 

0 

0     1 

1 

.     ^0- 

-1  ~ 

0 

-1 

0     0 

"^2  See  [20],  pp.  632-41.  Birkhoff  and  Ince  formulate  the  theorem  for  the  singular 
point  2=  00 .  This  is  no  restriction,  because  every  singular  point  2=- a  can  be  carried  by 
the  transformation  2'  z=l/  (2  —  a)  into  z'  =  cc. 

43  In  the  case  q  =  l  the  erroneous  statement  of  Birkhoff  coincides  in  essence  with 
Volterra's  mistake  (see  p.  145). 
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Applying  Birkhoff's  theorem  and  substituting  in    (79)    the  product 

dY 
dz 


dY  P* 

AY  for  X  in  (79),  we  obtain  after  replacing  -^^  by  —=^    and  cancelling  Y : 


z  dz 

Equating  the  coefficients  of  I/0  and  of  the  free  terms  we  find : 

Pli=P-i,     A,P_,-P_^A^  +  A,=P,. 
Setting 


we  obtain 


This  is  a  contradictory  equation. 

In  the  following  section  we  shall  examine,  for  the  case  of  a  regular  singu- 
larity, what  canonical  form  the  system  (79)  can  be  transformed  into  by 
means  of  a  transformation  (80). 


Ai  = 

a     h 
c     d 

? 

a     0 

0         0| 

0     1 

c     0 

~ 

—  c 

—  d 

^^ 

0     0 

§  10.    Regular  Singularities 

In  studying  the  behavior  of  a  solution  in  a  neighborhood  of  a  singular  point 
we  can  assume  without  loss  of  generality  that  the  singular  point  is  ^  =  0.*^ 

1.    Let  the  given  system  be 


where 


'i=pi^)x, 


p{^)=^+2:p,. 


771  =  0 

00 

and  the  series  2*  P^  z"*  converges  in  the  circle  \z\  <  r. 

Tn=0 

We  set 


where 


X  =  A{z)Y, 
A{z)=E  +  A^z  +  A^z^  + 


(81) 
(82) 


(83) 
(84) 


**  By  the  transformation  z'  ■=^z  —  a  or  0'  =  \l z  every  finite  point  0  1=  a  or  2  =  00  can 
be  carried  into  z'  =  0. 
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Leaving  aside  for  the  time  being  the  problem  of  convergence  of  the  series 
(84),  let  us  try  to  determine  the  matrix  coefficients  Am  such  that  the  trans- 
formed system 


where 


S=^-(^)^> 


m=0 


(85) 
(86) 


is  of  the  simplest  possible  ('canonical')  form.*^ 

When  we  substitute  the  product  AY  for  X  in  (81)  and  use  (85),  we 
obtain : 

A  {z)  P*{z)  Y+-gY  =  P{z)A{z)Y. 

Multiplying  both  sides  of  the  equation  by  Y~^  on  the  right  we  find: 

dA 


P{z)A{z)-A{z)P*{z) 


dz 


When  we  replace  here  P{z),  A{z),  and  P*(z)  by  the  series  (82),  (84), 
and  (86)  and  equate  the  coefficients  of  equal  powers  of  z  on  the  two  sides, 
we  obtain  an  infinite  system  of  matrix  equations  for  the  unknown  coefficients 
Ai,  A2,  ,  .  .  :*« 


1.  P-i  =  Pli, 

2.  P_,A,  -  A,  {P_,  +  E)  +  Po=  P;  , 

3.  P_^A^  -  A^  {P_,  +2E)  +  PoA^  -  A^P*  +  Pi  =  PI , 

{m  +  2).  P_iA^+i  -  A^+i  [P_i  +  (m  +  1)  ^]  + 

+  Pq-^wi        -^mPo  +  Pl-^m-X       -^m-lPl  +  1"  Pfn  ^^  Pjn  • 


(87) 


2.    We  consider  several  cases  separately : 

1.    The  matrix  P_i  does  not  have  distinct  characteristic  valv.es  that 
differ  from  each  other  by  an  integer. 


*5  We  shall  aim  at  having  only  a  finite  number  (and  indeed  the  smallest  possible 
number)  of  non-zero  coefficients  Pm  in  (86). 

*^  In  all  the  equations  beginning  with  the  second  we  replace  P-i  by  P  ^i  in  accordance 
with  the  first  equation. 
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In  this  case  the  matrices  P_i  and  P-i  +  kE  do  not  have  characteristic 
values  in  common  for  any  k  =  l,2,S,  . . . ,  and  therefore  (see  Vol.  I,  Chapter 
VIII,  §  3)*'  the  matrix  equation 

P_^U~U{P_^  +  kE)=T 

has  one  and  only  one  solution  for  an  arbitrary  right-hand  side  T. 
We  shall  denote  this  solution  by 

We  can  therefore  set  all  the  matrices  P^  {m  =  0,l,2,  .  .  .)  in  (87)  equal  to 
zero  and  determine  Ai,  A2,  . .  .  successively  by  means  of  the  equation 

^1  =  <^i  (^-1.  -  ^0).     ^2  =  ^2  (^-1,  -Pi- ^0^1),  •  •  •  • 
The  transformed  system  is  then  a  Cauchy  system 

dz—    z    ^' 
and  so  the  solution  X  of  the  original  system  ( 81 )  is  of  the  form*^ 

X  =  A{z)zP-K  (88) 

2.  Among  the  distinct  characteristic  values  of  P_i  there  are  some  whose 
difference  is  am,  integer;  furthermore,  the  matrix  P_i  is  of  simple  structure. 

We  denote  the  characteristic  values  of  P_i  by  Xi,  I2,  -  -  - ,  K  and  order 
them  in  such  a  way  that  the  inequalities 

Ee  (Ai)  ^  Re  {X^)  ^  . . .  ^  Re  {K)  (89) 

hold. 


*''  However,  we  can  also  prove  this  without  referring  to  Chapter  VIII.  The  proposi- 
tion in  which  we  are  interested  is  equivalent  to  the  statement  that  the  matrix  equation 

p.ii7  =  r7(p_i  +  fci;)  (*) 

has  only  the  solution  17=  0.    Since  the  matrices  P-i  and  P-i  +  ^^  have  no  characteristic 
values  in  common,  there  exists  a  polynomial  /(X)  for  which 

/(P_x)rz:zO,/(P-x  +  fcJ5;)=:^. 

But  from  (*)  it  follows  that 

/(P_x)Z7zz:Z7/(P-i  +  fcE). 

Hence  V=0. 

^s  The  formula  (88)  defines  one  integral  matrix  of  the  system  (81).  Every  integral 
matrix  is  obtained  from  (88)  by  multiplication  on  the  right  by  an  arbitrary  constant 
non-singular  matrix  C 
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Without  loss  of  generality  we  can  replace  P_i  by  a  similar  matrix.  This 
follows  from  the  fact  that  when  we  multiply  both  sides  of  (81)  on  the  left 
by  a  non-singular  matrix  T  and  on  the  right  by  r~\  we  in  fact  replace  all 
the  Pm  by  TPmT~^  {m  =  — 1,  0,  1,  2,  .  .  .);  moreover,  X  is  replaced  by 
TXT~^ .  Therefore  we  may  assume  in  this  case  that  P_i  is  a  diagonal 
matrix : 

We  introduce  a  notation  for  the  elements  of  Pm,  Pm  and  Am : 

p.=\\pS'\\%  i';=iipr*ii?.  ^„= II 4"" 111-        (»i) 

In  order  to  determine  Ai,  we  use  the  second  equation  in  (87).  This 
matrix  equation  can  be  replaced  by  the  scalar  equations 

(A;  -  A,  - 1)  4«  +  pS«  =i»<°>'         {i,k  =1,2 n)  (92) 

If  none  of  the  differences  A^-  —  X^  is  1,  we  can  set  P*=0.  We  then  have 
from  (872)  that  Ai  =  ^i(P_i  —  Po).*^ 

In  that  case  the  elements  of  Ai  are  uniquely  determined  from  (92)  : 

But  if  for  some^°  i,  k 

-^i  ■"  -^A  ~  1 » 

then  the  corresponding  p^c    is  determined  from  (92)  : 

and  the  corresponding  jcJJ   can  be  chosen  quite  arbitrarily. 
For  those  i  and  k  for  which  Xi  —  Xjc  y^l  we  set : 

pr=o, 

and  find  the  corresponding  x^^  from  (93). 

Having  determined  Ai,  we  next  determine  A2  from  the  third  equation 
of  (87) .    We  replace  this  matrix  equation  by  a  system  of  n^  scalar  equations : 

(A,-A,-2)  xg'=pg^*-p[^i-(P^,-A,PX  (94) 

{i,k=l,2,   ...     ,71). 

Here  we  proceed  exactly  as  in  the  determination  of  Ai. 


*9  "We  use  the  rotation  introduced  in  dealing  with  the  case  1. 
50  By  (89)  this  is  only  possible  for  i  <  fc. 
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If  li  —  X]cy^2,  then  we  set : 


and  find  from  (94)  : 


,(2) 


But  if  Ai  —  X]c  =  2,  then  it  follows  from  (94)  for  these  i  and  k  that : 

In  this  case  Xij^  is  chosen  arbitrarily. 

Continuing  this  process  we  determine  all  the  matrices  Pl^,  PJ  ,  PJ  ,  . . . 
and  Ai,  A2,  ...  in  succession. 

Furthermore,  only  a  finite  number  of  the  matrices  Pm  is  different  from 
zero  and,  as  is  easy  to  see,  P*(z)  is  of  the  form^^ 


P*  {z)  = 


-    a,,z 
0      i 

z 


Ai— ^2— 1 


«ln=^^'""^"~^ 


.  .  .   a^nZ^^-^""-^ 


0        0 


(95) 


where  a^fc  =  0,  when  k  —  kk  is  not  a  positive  integer,  and  aijc  =  Pik^~^^~^^*, 
when  Xi  —  Afc  is  a  positive  integer. 

We  denote  by  rrii  the  integral  part  of  the  numbers  Re  Xi  :^^ 


m,  =  [Re(A,)]         (i=l,2,  ...,  n). 


(96) 


Then,  by  (89), 


m^  ^  mg  ^  •  •  •  ^  w„ . 


If  Xi  — Xk  is  an  integer,  then 


^i  —  ^k  =  '^i  —  ^k' 


^^  Pm  (w  ^  0)  can  be  different  from  zero  only  when  there  exist  characteristic  values  Xi 
and  Xk  of  P-i  such  that  X^  —  Xk  —  l=^m  (and,  by  (89),  i<fc).  For  a  given  m  there 
corresponds  to  each  such  equation  an  element  fik^^^*  "^^  ^^^  ^^  ^^^  matrix  P^ ;  this  element 
may  be  different  from  zero.  All  the  remaining  elements  of  P*  are  zero. 

*2  I.e.,  mi  is  the  largest  integer  not  exceeding  Ke  Xi  (i=:  1,  2,  . . . ,  n). 
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Therefore  in  the  expression  (95)  for  the  canonical  matrix  P*{z)  we  can 
replace  all  the  differences  X^  —  X^  by  mt  —  mfc.     Furthermore,  we  set : 


M 


Then  it  follows  from  (95)  (see  formula  I  on  p.  134)  : 


^i<5i&|li, 

{i-- 
U= 

=  1,2,.. 

0        Ag 

...  ai„ 

•  •  •    «2n 

(91') 
(97) 

0      0 

...I 

-^^^z-^+^  =  D,  (z^z^) . 


P*(z)=z^— z-^  + 


(98) 


Hence  Y  =  z^z^  is  a  solution  of  (85)  and 

X=A{z)z'^z^ 

is  a  solution  of  (81).^^ 

3.  The  general  case.  As  we  have  explained  above,  we  may  replace  P_i 
without  loss  of  generality  by  an  arbitrary  similar  matrix.  We  shall  assume 
that  P_i  has  the  Jordan  normal  form^* 


with 


P-l  ={  ^1^1  +  Hif  ^2^2  +  ^2'   •  •  •>  K^u  +  ^« }  ' 


Re  (Ai)  ^  Re  (X.^)  ^  •  •  •  ^  Re  {XJ . 


(99) 


(100) 


Here  E  denotes  the  unit  matrix  and  H  the  matrix  in  which  the  elements  of 
the  first  superdiagonal  are  1  and  all  the  remaining  elements  zero.  The  orders 
of  the  matrices  Ei  and  Hi  in  distinct  diagonal  blocks  are,  in  general,  differ- 
ent ;  their  orders  coincide  with  the  degrees  of  the  corresponding  elementary 
divisors  of  P_i.^° 

In  accordance  with  the  representation   (99)   of  P_i  we  split  all  the 
matrices  Pm,  Pm,  Am  into  blocks : 


53  The  special  form  of  the  matrices  (97)  corresponds  to  the  canonical  form  of  P-i. 
If  P_i  does  not  have  the  canonical  form,  then  the  matrices  M  and  17  in  (98)  are  similar  to 
the  matrices  (97). 

54  See  Vol.  I,  Chapter  VI,  §  6. 

55  To  simplify  the  notation,  the  index  that  indicates  the  order  of  the  matrices  is  omitted 
from  Ei  and  Hi. 
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Pm  =  iPS'n,   P*m=  iP\rri^   ^m-  (XSYl- 

Then  the  second  of  the  equations  (87)  may  be  replaced  by  a  system  of 
equations 

{X,E,  +  H,)  Zg>  -  za>  [(X,  +  1)E,  +  H,]  +  PS'=PL"'*,  (101) 

which  can  also  be  written  as  follows : 

a,- A,- 1)  X<»  +  H,Xg^-X^H,  +  Pif  =  PT     (i.  fc=l>  2,  . . .,  u).     (102) 
Suppose  that^^ 


^11        ^12 


21       •*'22 


^     II       p(0)_||„(0)||      p(0)*_||      (C 
^st  II  »  -^iA   —  II  ^'i-t     II  >  ^ik      —  II  Vst 


(0)^ 


Then  the  matrix  equation  (102)   (for  fixed  i  and  k)  can  be  replaced  by  a 
system  of  scalar  equations  of  the  f  orm^'' 


{Xi—Xj,—  l)  x,t  +  a;,+i,  —  X,, «_i  +  ^f  =  jj^^ 


(103) 


(s=l,2,  ...,v\  t=l,2,  ...,w;  a:^+i, 


''5,0 


0), 


where  v  and  w  are  the  orders  of  the  matrices  AiJEJ^  +  Hi  and  AfojEJ^  +  J^fc  in  (99) . 
If  Xi  —  Xjcy^  1,  then  in  (103)  we  can  set  all  the  Pst-^^*  equal  to  zero  and 
determine  all  the  Xst  uniquely  from  the  recurrence  relations  (103).     This 
means  that  in  the  matrix  equations  (102)  we  set 


>(0)=' 


0 


and  determine  X^jJ  uniquely. 

If  Ai  —  Afc  =  l,  then  the  relations  (103)  assume  the  form 


(0)        (oy 


(104) 


iXv+i,t  =  Xs,o  =  0;  5=1,2,  ...,  v;  ^  =  1,2,  . . .,  w). 


56  To  simplify  the  notation,  we  omit  the  indices  i,  fc  in  the  elements  of  the  matrices 

XT        p(0)  p(0)^ 

57  The  reader  should  bear  in  mind  the  properties  of  the  matrix  H  that  were  developed 
on  pp.  13-15  of  Vol.  I. 
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It  is  not  difficult  to  show  that  the  elements  x^t  of  Xji^  can  be  determined 
from  (104)  so  that  the  matrix  Pj°^*has,  depending  on  its  dimensions  {vY^w), 
one  of  the  forms 


ao     0 


0  0  .  .  .  0 
0  0  .  .  .  0 


o»_i     Ov-2  •  •  •     ^1     ^0  ^v-\  «i     ao  0  .  .  .  0 

[v  —  w)  (v  <  w) 

0      0     ...    0 


0  0  .  .  .  0 
do  0  ...  0 
a-i     an    ...    0 


"X      "^0 


(105) 


a«,_i  .  .  .    «!     «( 
(t7>iy) 

We  shall  say  of  the  matrices  (105)  that  they  have  the  regular  lower 
triangular  form.^^ 

From  the  third  of  the  equations  (87)  we  can  determine  A2.  This  equa- 
tion can  be  replaced  by  the  system 

{X,-X,^2)Xf  +  H,Xf-j'SH,  +  {P,A^-AJ>,},,  +  P'}^=P':,'*      (106) 

{i,k=l,2,  ...,  u). 

In  the  same  way  that  we  determine  Ai,  we  determine  Xl^  uniquely  with 
p(y*  =  0  from  ( 106 )  provided  Xi  —  Xu¥-2.  But  if  Xi  —  h  =  2,  then  Xl?  can 
be  determined  so  that  Pii*  is  of  regular  lower  triangular  form. 


.(1) 


5^  Regular  upper  triangular  matrices  are  defined  similarly.  The  elements  of  Zik  are 
not  all  uniquely  determined  from  (104)  ;  there  is  a  certain  degree  of  arbitrariness  in  the 
choice  of  the  elements  x,t.  This  is  immediately  clear  from  (102):  for  Xi  —  X*  =r  1  we 
may  add  to  JCik  an  arbitrary  matrix  permutable  with  H,  i.e.,  an  arbitrary  regular  upper 
triangular  matrix. 


156   XIV.  Applications  to  Systems  of  Linear  Differential  Equations 

Continuing  this  process,  we  determine  all  the  coefficient  matrices  Ai, 
A2, . . .  and  P*i,  Pq  ,  P*, ...  in  succession.  Only  a  finite  number  of  the  coef- 
ficients Pm  is  different  from  zero,  and  the  matrix  P*(z)  has  the  following 
block  form  :^® 


^  ^  ^U^U  +  Hj, 


where 

iO  if  Xi  —  Ijc  is  not  a  positive  integer, 

p(k-i^-i)*     .^  ^_  _  Afc  is  a  positive  integer. 

All  the  matrices  Bi^  {i,k  =  l,  2,  . . . ,  u;  i  <CJc)  are  of  regular  lower  trian- 
gular form. 

As  in  the  preceding  case,  we  denote  by  nii  the  integral  part  of  Re  Xi 

m,=  [Re(A,)]  (i=l,2,  . . ,  ,  u)  (108) 

and  we  set 

A,  =  m,  +  A7  (*=1,2,  ...  ,  w).  (108') 

Then  in  the  expression  (107)  for  P^(z)  we  may  again  replace  the  difference 
^i  —  h  everywhere  by  m^  —  nik-  If  we  introduce  the  diagonal  matrix  M 
with  integer  elements  and  the  upper  triangular  matrix  Z7  by  means  of  the 
equations®^ 

O  l^E^  +  if  2  •  •  • 

0  0         ..  . 

then  we  easily  obtain,  starting  from  (107),  the  following  representation  of 

P*(2r): 


^^  The  dimensions  of  the  square  matrices  Ei,  Hi  and  the  rectangular  matrices  Bik  are 
determined  by  the  dimensions  of  the  diagonal  blocks  in  the  Jordan  matrix  P-i,  i.e.,  by 
the  degrees  of  the  elementary  divisors  of  P-i. 

®°  Here  the  splitting  into  blocks  corresponds  to  that  of  P-i  and  P*(z). 
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Hence  it  follows  that  the  solution  (85)  can  be  given  in  the  form 

and  the  solution  of  (81)  can  be  represented  as  follows: 

X  =  ^(z)2^z^.  (110) 

Here  A{z)  is  the  matrix  series  (84),  M  is  a  constant  diagonal  matrix  whose 
elements  are  integers,  and  U"  is  a  constant  triangular  matrix.  The  matrices 
M  and  Z7  are  defined  by  (108),  (108'),  and  (109).^^ 

3.    We  now  proceed  to  prove  the  convergence  of  the  series 

We  shall  use  a  lemma  which  is  of  independent  interest. 
Lemma  :    If  the  series^^ 

X  =  aQ  +  a^z  +  a^z^ -i (111) 

formally  satisfies  the  system 

^,=P{z)x  (112) 

for  which  z  =  Ois  a  regular  singularity,  then  (111)  converges  in  every  neigh- 
borhood of  z^O  in  which  the  expansion  of  the  coefficient  matrix  P{z)  in 
the  series  (82)  converges. 
Proof.    Let  us  suppose  that 

s-o 

oo 

where  the  series   2J  P"»  Zm  converges  for  \  z\  <r.     Then  there  exist  posi- 

OT=0 

tive  constants  p_i  and  p  such  that®^ 

modP_i^?)_i/,  modP^^^/,     -^=||1||         (m=0,l,2,  ...).    (113) 

Substituting  the  series  (111)  for  x  in  (112)  and  comparing  the  coeffi- 
cients of  like  powers  on  both  sides  of  (112),  we  obtain  an  infinite  system  of 
(column)  vector  equations 


61  See  footnote  53. 

®2  Here  x=:  (xi,  Xs,  .  . . ,  Xn)  is  a  column  of  unknown  functions;  00,01,02,...  are  con- 
stant columns;  T{z)  is  a  square  coefficient  matrix. 

63  For  the  definition  of  the  modulus  of  a  matrix,  see  p.  128. 
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{2E-  P_i)  a^  =  Po«i  +  ^i«0' 


{mE  —  P_i)  a^  =  Po«m-l  +  Pl<^m--  +  '"  +  ^m-l«0 


(114) 


It  is  sufficient  to  prove  that  every  remainder  of  the  series  (111) 

a;W  =a^2*  +  %+iZ*+i  +  •  •  •  (115) 

converges  in  a  neighborhood  of  z  =  0.  The  number  k  is  subject  to  the 
inequality 

k  >  np-i. 

Then  k  exceeds  the  moduli  of  all  the  characteristic  values  of  P_i,®*  so  that 
f or  m  ^  A;  we  have  |  mE  —  P_i  |  y^  0  and 

(^^_P_,)-=1(s_%1)-^  =  1b  +  1,P_,  +  ;^P1.  +  -    (116) 

{m=k,k  +  l,  ...). 

In  the  last  part  of  this  equation  there  is  a  convergent  matrix  series.  "With 
the  help  of  this  series  and  by  using  (114),  we  can  express  all  the  coefficients 
of  (115)  in  terms  of  ao,  ai,  .  .  .  ,  a^-i  by  means  of  the  recurrence  relations 

a^  =  [^E-\-^,  P_l  +  ^  PLi  +  •  •  •)   Um-l  +  Po^m-l  +  •  •  •  +  Pm-l:-ia,), 

{m  =  k,k  +  \,  ...)  (117) 

where 

L-l  =  Pm-lcO'k-l  +  •  •  •  +  Pm-lO'O  {m=  k,  k  +  I,  .  .  .) .  (118) 

Note  that  this  series  (115)  formally  satisfies  the  differential  equation 

^*  If  Xo  is  a  characteristic  value  of  A=  \\aik  \\l,  then  |  Xo  |  ^  7i»max  |  aik  |.     For  let 
Ax  =  XoX,  where  x  =  (xi,  Xz,  .  .  . ,  Xn)  ^  0.    Then  ~  '    ~ 

n 
Vi  =^  (^ikXk         (*  —  1,  2,  ...  n). 
k=l 

Let  \xi\=z  max  {\x^\,  \x2\,  ... ,  \xn\}.     Then 

n 

1  ^0  I  I  «;•  I  ^  ^  I  «/*  I  \xic\^\xj\n     max     [  atk  \ . 

k^l  1  ^  t,  A:  ^  n 

Dividing  through  \xi\,  we  obtain  the  required  inequality. 
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dz 


P{z)  a:(*)  +  /  {z), 


where 


f{z)=    2J  fn/"  =P(^)  (ao  +  aiz  +  •  •  •  +  a,_^z'-')  - 

—  ai  —  2  a^z (^  —  1)  a^-iz^~^ 

From  (120)  it  follows  that  the  series 


159 
(119) 

(120) 


m=k—l 

converges  for  \  z\  <  r ;  hence  there  exists  an  integer  iV  >  0  such  that^ 


mod  /^  ^ 


{m=k—l,k,  . . .). 


(121) 


From  the  form  of  the  recurrence  relations  (117)  it  follows  that  when  the 
matrices  P-i,  Pq,  fm-i  in  these  relations  are  replaced  by  the  majorant 
matrices  p-il,  pr-^I,  ||  ^;^j^  ||  and  the  column  am  by  am  (m  =  k,  k  +  1, 
...  ;  ^  =  0,  1,  2,  . .  .),^^  then  we  obtain  relations  that  determine  upper  bounds 
II  am  II  for  moda^: 

(122) 


moda^^lja^ 


Therefore  the  series 


p)  =  ap*  +  a^_^^2*+i 


(123) 


after  term-by-term  multiplication  with  the  column  ||  1  ||  becomes  a  majorant 
series  for  (115). 

By  replacing  in  (119)  the  matrix  coefficients  P_i,  Pq,  fm  of  the  series 


p{z)=-f^+2:  p,z^,    f{z)=  2:  Lz' 


q=0 


m=k—l 


by  the  corresponding  majorant  matrices  p_-^  I,  ^  /, 
a  differential  equation  for  |(*) : 


N_ 
rm 


zk-\ 


dz  \  ^         1  — zr~^j  *  2 


!<*)  |L  we  obtain 


(124) 


65  Here  1|  N/r^  \\  denotes  the  column  in  which  all  the  elements  are  equal  to  one  and  the 
same  number,  N/r***. 

66  Here    \\  am  \\    denotes   the   column    (am,  am,  ...,  am)    (am    is    a    constant,    m  =  lc, 

fc  +  1,  ...). 
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This  linear  differential  equation  has  the  particular  solution 

0 

which  is  regular  for  z  =  0  and  can  be  expanded  in  a  neighborhood  of  this 
point  in  the  power  series  (123)  which  is  convergent  for  |  0  |  <  r. 

From  the  convergence  of  the  majorant  series  (123)  it  follows  that  the 
series  (115)  is  convergent  for  |  2;  |  <  r,  and  the  lemma  is  proved. 

Note  1.  This  proof  enables  us  to  determine  all  the  solutions  of  the  differ- 
ential system  (112)  that  are  regular  at  the  singular  point,  provided  such 
solutions  exist. 

For  the  existence  of  regular  solutions  {not  identically  zero)  it  is  necessary 
and  sufficient  that  the  residue  matrix  P_i  have  a  non-negative  integral  char- 
acteristic value.  If  s  is  the  greatest  integral  characteristic  value,  then  columns 
ao,  ai,  .  . .  ,as  that  do  not  all  vanish  can  be  determined  from  the  first  5  +  1  of 
the  equations  (114)  ;  for  the  determinant  of  the  corresponding  linear  homo- 
geneous equation  is  zero : 

A=\P_^\\E-P_i\"-\sE-P_i\  =  0. 

From  the  remaining  equations  of  (114)  the  columns  as+i,  as+2,  ■  -  -  can  be 
expressed  uniquely  in  terms  of  Go,  ai,  . .  . ,  ag.  The  series  (111)  so  obtained 
converges,  by  the  lemma.  Thus,  the  linearly  independent  solutions  of  the 
first  5  +  1  equations  (114)  determine  all  the  linearly  independent  solutions 
of  the  system  (112)  that  are  regular  at  the  singular  point  0  =  0. 

If  0  =  0  is  a  singular  point,  then  a  regular  solution  (111)  at  that  point 
(if  such  a  solution  exists)  is  not  uniquely  determined  when  the  initial  value 
ao  is  given.  However,  a  solution  that  is  regular  at  a  regular  singularity  is 
uniquely  determined  when  ao,  ai,  .  .  . ,  ag  are  given,  i.e.,  when  the  initial 
values  at  0  =  0  of  this  solution  and  the  initial  values  of  its  first  s  derivatives 
are  given  (s  is  the  largest  non-negative  integral  characteristic  value  of  the 
residue  matrix  P_i). 

Note  2.  The  proof  of  the  lemma  remains  valid  for  P^i  =  0.  In  this 
case  an  arbitrary  positive  number  can  be  chosen  for  p_i  in  the  proof  of  the 
lemma.  For  P_i  =  0  the  lemma  states  the  well-known  proposition  on  the 
existence  of  a  regular  solution  in  a  neighborhood  of  a  regular  point  of  the 
system.  In  this  case  the  solution  is  uniquely  determined  when  the  initial 
value  do  is  given. 

4.    Suppose  given  the  system 

~=P{z)X,  (126) 
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where 

m=0 

and  the  series  on  the  right-hand  side  converges  for  |  0  |  <  r. 
Suppose,  further,  that  by  setting 

X=A{z)Y  (127) 

and  substituting  for  Aiz)  the  series 

A{z)  =  Ao  +  A^z  +  A2Z^  +  '"   ,  (128) 

we  obtain  after  formal  transformations: 

%  =  P'(z)Y,  (129) 

where 

7W=0 

and  that  here,  as  in  the  expression  for  P{z),  the  series  on  the  right-hand 
side  converges  for  |  0  |  <  r. 

We  shall  show  that  the  series  (128)  also  converges  in  the  neighborhood 
I  0  I  <  r  of  0  =  0. 

Indeed,  it  follows  from  (126),  (127),  and  (129)  that  the  series  (128) 
formally  satisfies  the  following  matrix  differential  equation 

t1  A 

~=P{z)A-AP*{z).  (130) 

We  shall  regard  A  as  a  vector  (column)  in  the  space  of  all  matrices  of 
order  n,  i.e.,  a  space  of  dimension  n^.  If  in  this  space  a  linear  operator 
P{z)  on  A,  depending  analytically  on  a  parameter  z,  is  defined  by  the 
equation 

P{z)[A'\=P{z)A-AP*{z),  (131) 

then  the  differential  equation  (130)  can  be  written  in  the  form 

dA 


dz 


P{z)[A].  (132) 


The  right-hand  side  of  this  equation  can  be  considered  as  the  product  of 
the  matrix  r  (z)  of  order  n^  and  the  column  A  of  n^  elements.  From  (131) 
it  is  clear  that  2;  =  0  is  a  regular  singularity  of  the  system  (132) .  The  series 
(128)  formally  satisfies  this  system.  Therefore,  by  applying  the  lemma,  we 
conclude  that  (128)  converges  in  the  neighborhood  |  2;  |  <  r  of  2;  =  0. 
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In  particular,  the  series  for  A{z)  in  (110)  also  converges. 
Thus,  we  have  proved  the  following  theorem : 

Theorem  2.    Every  system 

^=P(z)X,  (133) 

with  a  regular  singularity  at  z^O 

has  a  solution  of  the  form 

X  =  A{z)z'^z^  ,  (134) 

where  A{z)  is  a  matrix  function  that  is  regular  for  z  =  0  and  becomes  the 
unit  matrix  E  at  that  point,  and  where  M  and  U  are  constant  matrices,  M 
being  of  simple  structure  and  having  integral  characteristic  values,  whereas 
the  difference  between  any  two  distinct  characteristic  values  of  U  is  not  an 
integer. 

If  the  matrix  P_i  is  reduced  to  the  Jordan  form  by  means  of  a  non- 
singular  matrix  T 

P_^=T  {Ai  J^i  +  H^,  X^E^  +  H^,.,.,  X,E,  +  H,}  T-^  (135) 

(Re  (Ai)  ^  Re  {X^)  ^  •  •  •  ^  Re  (A,)) , 


(136) 

T-i,  (137) 

0  O  ...   KE,  +  H,i 

where 

»^i=W,     li  =  k-m,        {i=l,2,...,s),  (138) 

The  Bijc  are  regular  lower  triangular  matrices  {i,k  =  l,2, . . .  ,  s)  and  Biu  =  0 
if  Xi  —  X]cis  not  a  positive  integer  (i,k  =  l,2,  . . . ,  s). 

In  the  particular  case  where  none  of  the  differences  Xi  —  X^  (i,  k  =  l,  2, 
d,  . . . ,  s)  is  a  positive  integer,  we  can  set  in  (134)  M  =  0  and  U  =  P-i;  i.e., 
in  this  case  the  solution  can  be  represented  in  the  form 

X=A{z)z^-K  (139) 


then  M  and  U 

can 

be  chosen  in 

the  form 

_ 

M=T[m^. 

E^,  m^E^,  . . 

,m. 

E.) 

T-\ 

AA  +  ^1 

Sx2 

Bu 

U- 

=  T 

".. 

^E^  +  H^ 

... 

S,s 
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Nate  1.    We  wish  to  point  out  that  in  this  section  we  have  developed  an 

oo 

algorithm  to   determine   the   coefficients  of   the   series   A{z)  =  ^  Am^"^ 

m=0 

(^0  =  ^)  in  terms  of  the  coefficients  Pm  of  the  series  for  P(z).  Moreover, 
the  theorem  also  determines  the  integral  substitution  V  by  which  the  solution 
(134)  is  multiplied  when  a  circuit  is  made  once  in  the  positive  direction 
around  the  singular  point  z  =  0: 

y—-^2nin^ 

Note  2.    From  the  enunciation  of  the  theorem  it  follows  that 


B,,  =  0     for    A,5^A,        (i,fc  =  l,2,... 

,s). 

Therefore  the  matrices 

/O   B,, 

. . .  B,; 

A=  T {IijEJi,  l^E^,  ...,  Ie,}  T-1     and    U=tI^   ^ 

...^2. 

\o  0 

...0    , 

are  permutable : 

AU=UA. 
Hence 

^M^U  =  ^M^A+  U  ^  ^M^A^V  =  ^A^U^ 

where 

(140) 


(141) 
A=M  +  A  =  T  {Ai,  A2,  . . . ,  AJ  T-i  (142) 

and  where  Ai,  A2,  .  .  . ,  A„  are  all  the  characteristic  values  of  P_i  arranged 
in  the  order  Re  Ai  ^  Re  A2  ^  . .  •  ^  Re  A„. 
On  the  other  hand, 

where  /i( A)  is  the  Lagrange-Sylvester  interpolation  polynomial  iorf{X)=z^. 
Since  all  the  characteristic  values  of  V  are  zero,  h{X)  depends  linearly 
on  /(O),  /'(O),  .  .  .  ,  /(^-i)(0),  i.e.,  on  1,  In  ^,  ... ,  (In  0)^-1  {g  is  the  least 
exponent  for  which  fj9=0).    Therefore 

and 

(1      ?12  •  •  •   ^h 
0       1        .  .  .    g2n  j   ^T-l  ^  (143) 

0     0     ...  1 
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where  Qh  (i,j  =  l,  2,  .  .  .  ,  n;  i  <  j)  are  polynomials  in  In  z  of  degree  less 
than  g. 

By  (134),  (141),  (142),  and  (143)  a  particular  solution  of  (126)  can  be 
chosen  in  the  form 


X  =  A{z) 


^'     0    ...  0 
I      z'^  ...0 


0       0 


1 

?12   • 

••    qin 

0 

1    . 

••    ?2n 

0 

0     . 

..    1 

(144) 


Here  Ai,  A2,  •  •  • ,  >^n  are  the  characteristic  values  of  P_i  arranged  in  the  order 
Re  Ai  ^  Re  A2  ^  .  . .  ^  Re  ^n,  and  qij  {i,  j  =  l,  2,  .  .  . ,  n;  i  <  j)  are  poly- 
nomials in  In  z  of  degree  not  higher  than  g  —  1,  where  g  is  the  maximal  num- 
ber of  characteristic  values  Xi  that  differ  from  each  other  by  an  integer ;  A{z) 
is  a  matrix  function,  regular  at  ^  =  0,  and  A{0)  =  T  {\  T  \  y^  0) .  If  P_i  has 
the  Jordan  form,  then  T  =  E. 


§11.    Reducible  Analytic  Systems 


1.    As  an  application  of  the  theorem  of  the  preceding  section  we  shall  investi- 
gate in  what  cases  the  system 


dX 
dt 


Q(t)X, 


where 


(145) 


(146) 


is  a  convergent  series  for  t  >  to,  is  reducible  (in  the  sense  of  Lyapunov),  i.e., 
in  what  cases  the  system  has  a  solution  of  the  form 


X  =  L(t)e^', 


(147) 


where  L{t)  is  a  Lyapunov  matrix  (i.e.,  L(t)  satisfies  the  conditions  1.-3.  on 
p.  117)  and  J5  is  a  constant  matrix.®^     Here  X  and  Q  are  matrices  with 
complex  elements  and  t  is  a,  real  variable. 
We  make  the  transformation 

1 
^  =  T' 

^"^  If  the  equation  (147)  holds,  then  the  Lyapunov  transformation  X  =  L(t)Y  carries 

dr 


the  system  (145)  into  the  system 


BY. 
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Then  the  system  ( 145 )  assumes  the  form 

dX 


dz 


=  P{z)X, 


where 


P{z)  =  -z-^q[^) 


r-IJQ. 


m+2 


Z     . 
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(148) 


(149) 


m=0 


The  series  on  the  right-hand  side  of  the  expression  for  P{z)  converges  for 
I  z  I  <l/#o-    Two  cases  can  arise : 

1 )  Qi  =  0.  In  that  case  0  =  0  is  not  a  singular  point  of  the  system  ( 148 ) . 
The  system  has  a  solution  that  is  regular  and  normalized  at  0  =  0.  This  solu- 
tion is  given  by  a  convergent  power  series 


X{z)  =  E  +  X^z  +  X2Z^  + 


Setting 


i<i) 


we  obtain  the  required  representation  (147) .    The  system  is  reducible. 

2)  Qi  =7^  0.  In  that  case  the  system  (148)  has  a  regular  singularity  at 
0  =  0. 

Without  loss  of  generality  we  may  assume  that  the  residue  matrix 
P_i  =  —  Qi  is  reduced  to  the  Jordan  form  in  which  the  diagonal  elements 
h,  ^2,  •  •  • ,  ^  are  arranged  in  the  order  Re  Ai  ^  Re  A2  ^  . . .  ^  Re  An. 

Then  in  (144)  T  =  E,  and  therefore  the  system  (148)  has  the  solution 


X  =  A{z) 


z^^     0    ...  0 
0      z^«  . . .  0 

0       0    . . .  z^' 


1 

^12   • 

••   9'ln 

0 

1    . 

••    ^2» 

0 

0     . 

..    1 

where  the  function  A{z)  is  regular  for  0  =  0  and  assumes  at  this  point  the 
value  E,  and  where  qiu  {1,^  =  1,  2,  .  .  . ,  u)  i  <:h)  are  polynomials  in  In  z. 
When  we  replace  z  by  l/#,  we  have : 


X  =  A 


t  J 
0 

0     . 

..     0 
..     0 

1 
0 

gi2(ln|)  . 
1 

.?in(ln|) 

.?2n(ln|) 

0 

0     . 

0 

0 

1 

(150) 
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Since  X  =  A(l/t)Y  is  a  Lyapunov  transformation,  the  system  (145)  is 
reducible  to  a  system  with  constant  coefficients  if  and  only  if  the  product 


L^{t)  = 


-h 

0     . 

..    0 

I 

^12(i^t)    • 

•«.»(ln|) 

0 

t-'^ . 

..    0 

0 
0 

1 

•  ?2„(ln|) 

0 

0     . 

.  .    t-'n 

0 

1 

J5t 


(151) 


where  5  is  a  constant  matrix,  is  a  Lyapunov  matrix,  i.e.,  when  the  matrices 
Li{t),-T^ ,  and  L~'^(t)  are  bounded.    It  follows  from  the  theorem  of  Erugin 

(§4)  that  the  matrix  B  can  be  assumed  here  to  have  real  characteristic 
values. 

Since  Li(t)  and  L~^it)  are  bounded  for  t  >  to,  all  the  characteristic 
values  of  B  must  be  zero.  This  follows  from  the  expression  for  e^^  and  e~^* 
obtained  from  (151).  Moreover,  all  the  numbers  Ai,  A2,  .  .  . ,  An  must  be  pure 
imaginary,  because  by  (151)  the  fact  that  the  elements  of  the  last  row  of 
Li{t)  and  of  the  first  column  of  L~'^(t)  are  bounded  implies  that  Re  ^^^  ^  0 
and  Re  Xi  ^  0. 

But  if  all  the  characteristic  values  of  P_i  are  pure  imaginary,  then  the 
difference  between  any  two  distinct  characteristic  values  of  P_i  cannot  be 
an  integer.    Therefore  the  formula  (139)  holds 


X  =  A{z)z^-^=A{^^)t<i^, 


and  for  the  reducibility  of  the  system  it  is  necessary  and  sufficient  that 
the  matrix 


L^{t)  =  t^^e- 


(152) 


together  with  its  inverse  be  bounded  for  t  >  to. 

Since  all  the  characteristic  values  of  B  must  be  zero,  the  minimal  poly- 
nomial of  B  is  of  the  form  X'^.    We  denote  by 


V^(A)=(A-//i)^^(A- 


i"2> 


'« •  •  •  {X  —  jLiJ^  {fii  j^fijcioriy^k) 


the  minimal  polynomial  of  Qi.  As  Qi  =  —  P-iy  the  numbers  /xi,  fi2,  .  .  .  ,  fiu 
differ  only  in  sign  from  the  corresponding  numbers  Xi  and  are  therefore  all 
pure  imaginary.    Then  (see  the  formulas  (12),  (13)  on  p.  116) 
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fi'  =  2:iU,,+  U,,lnt  +  .--+  U,,  ,^_i  (In  ()'*-']  f",  (153) 

A;=l 

e^'  =  Fo  +  Fi<  +  . . .  +  F^_3  ^-1 .  (154) 

Substituting  these  expressions  in  the  equation 

we  obtain 

[L^{t)  V,_,  +  (*)]  t'-'=  Z,{t)  (In  ty-' ,  (155) 

where  c  is  the  greatest  of  the  numbers  Ci,  C2,  .  .  . ,  Cn,  (*)  denotes  a  matrix 
that  tends  to  zero  for  ^  — >  00,  and  Zo{t)  is  a  bounded  matrix  for  t  >  to. 

Since  the  matrices  on  both  sides  of  (155)  must  be  of  equal  order  of  magni- 
tude for  t-^  00 ,  we  have 

d  =  c=l, 
i.e., 

B  =  0, 

and  the  matrix  Qi  has  simple  elementary  divisors. 

Conversely,  if  Qi  has  simple  elementary  divisors  and  pure  imaginary 
characteristic  values  /xi,  112, ... ,  fXn,  then 

X  =  A{z)z-^^^A{z)\\z-'id,,\\l 

is  a  solution  of  (149).    Setting  z  =  1/t,  we  find: 

The  function  X{t)  as  well  as  — jp^  and  the  inverse  matrix  X~'^{t)  are 

bounded  for  t  >  to.    Therefore  the  system  is  reducible  {B  =  0).    Thus  we 
have  proved  the  following  theorem :®® 

Theorem  3 :    The  system 
where  the  matrix  Q{t)  can  he  represented  in  a  series  convergent  for  t  ^  to 

is  reducible  if  and  only  if  all  the  elementary  divisors  of  the  residue  matrix 
Qi  are  simple  and  all  its  characteristic  values  pure  imaginary. 

^8  See  Erugin  [13].     The  theorem  is  proved  for  the  case  where  Qi  does  not  have  dis- 
tinct characteristic  values  that  differ  from  each  other  by  an  integer. 
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§  12.     Analytic  Functions  of  Several  Matrices  and  their 

Application  to  the  Investigation  of  Differential  Systems. 

The  Papers  of  Lappo-Danilevskii 

1.  An  analytic  function  of  m  matrices  Xi,  X2,  . . . ,  Xm  of  order  n  can  be 
given  by  a  series 

00       (l,...,m) 

F (Xi,  Xg,  . . . ,  Xj^  =ocq  +  ^       2J     ocj^j^ ,,,j^Xj^Xj^ ' ' ' Z;v       (156) 

convergent  for  all  matrices  Xj  of  order  n  that  satisfy  the  inequality 

modX^.<i2,.       {j=l,2,  ...,m).  (157) 

Here  the  coefficients 

ao'  ah7,.../v      (h>?2>  ...,?;=!,  2,  ...,m;  v=l,2,  3,  ...) 

are  complex  numbers,  Bj  (j  =  l,  2,  . . . ,  m)  are  constant  matrices  of  order  n 
with  positive  elements,  and  Xj  (i  =  l,  2,  . . . ,  m)  are  permutable  matrices 
of  the  same  order  with  complex  elements. 

The  theory  of  analytic  functions  of  several  matrices  was  developed  by 
I.  A.  Lappo-Danilevskii.  He  used  this  theory  as  a  basis  for  fundamental 
investigations  on  systems  of  linear  differential  equations  with  rational 
coefficients. 

A  system  with  rational  coefficients  can  always  be  reduced  to  the  form 

after  a  suitable  transformation  of  the  independent  variable,  where  Vju  are 
constant  matrices  of  order  n,  aj  are  complex  numbers,  and  Sj  are  positive 
integers  (Jc  =  0,  1,  .  . . ,  Sy_i ;  j  =  1,  2,  .  . . ,  m) .^^ 

We  shall  illustrate  some  of  Lappo-Danilevskii 's  results  in  the  special  case 
of  the  so-called  regular  systems.  The  latter  are  characterized  by  the  condi- 
tion Si  =  $2=  . .  .  =  Sm  =  l  and  can  be  written  in  the  form 

i^=  yJh^X.  (159) 

dz       ^  z  —  aj  ^ 

1=1  I 


*^  In  the  system  (158)  all  the  coefficients  are  regular  rational  fractions  in  z.  Arbi- 
trary rational  coefficients  can  be  reduced  to  this  form  by  carrying  a  finite  point  2  =  c 
that  is  regular  (for  all  coefficients)  by  means  of  a  fractional  linear  transformation  on  z 
into  0  =  00  . 
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Following  Lappo-DanilevskiT,  we  introduce  special  analytic  functions, 
namely  hyperlogarithms,  which  are  defined  by  the  following  recurrence 
relations : 


h{z;a.j,)=j^—, 

b 

k(z;  aj^,  aj„  .. .,  ayj  =/ 


b 

z 


dz 

z  —  aj, 

b 

Regarding  ai,  a2,  .  . . ,  am,  oo  as  branch  points  of  logarithmic  type,  we 
construct  the  corresponding  Riemann  surface  S{ai,  a2f  . . . ,  dm',  ^)-  Every 
hyperlogarithm  is  a  single-valued  function  on  this  surface.  On  the  other 
hand,  the  matricant  Q^  of  the  system  (159)  (i.e.,  the  solution  normalized  at 
0  =  6)  after  analytic  continuation  can  also  be  regarded  as  a  single-valued 
function  on  S{ax,  a2, .  . . ,  am;  ^)  '-,  here  h  can  be  chosen  as  an  arbitrary  finite 
point  on  S  other  than  a\,  a^,  .  .  . ,  Om- 

For  the  normalized  solution  Ql  Lappo-Danilevskii  gives  an  explicit  ex- 
pression in  terms  of  the  defining  matrices  TJi,  U2,  ■  .  .  ,  Um  of  (159)  in  the 
form  of  a  series 

00     (l,...,m) 

Ol=E  +  2;     2;     h{z;au,aj,,...,aj;)U^,Ui,'"Uj,.  (160) 

"=1  7i iv 

This  expansion  converges  uniformly  in  z  for  arbitrary  Ui,  TJ2,  •  •  - ,  Um  and 
represents  Ql  in  any  finite  domain  on  S(ai,  a2,  ...,  am;  <» )  provided  only 
that  the  domain  does  not  contain  ai,  a2,  .  .  . ,  am  in  the  interior  or  on  the 
boundary. 

If  the  series  (156)  converges  for  arbitrary  matrices  Xi,  X2, .  .  . ,  Xm,  then 
the  corresponding  function  F(Xi,  X2,  .  .  . ,  Xm)  is  called  entire.  Ql  is  an 
entire  function  of  the  matrices  Z7i,  Z72,  .  . . ,  TJm- 

If  in  (160)  we  let  the  argument  z  go  around  the  point  a,  once  in  the  posi- 
tive direction  along  a  contour  that  does  not  enclose  other  points  ai  (for 
i^  j),  then  we  obtain  the  expression  for  the  integral  substitution  Vj  corre- 
sponding to  the  point  z  =  aj: 

V^  =^E+^    2'"  V}  (&;  «fP  «/.'  •  •  V  a,v)  ^H^i.  .  • .  C^;.  (161) 

♦•=1  iu.-->iv 

{j  =  \,2,  ...,  m), 
where  in  a  readily  understandable  notation 
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dz 


Pj{b;ajJ=j 


z  —  aj, 
Pj  (6;  ttj^,  ttj,,  ...,  a.jj)  —j  ^^^j^T dz 

(aj) 

1^  =  1,2,3,... 

The  series  (161),  like  (160),  is  an  entire  function  of  Z7i,  TJ2,  - . . ,  Um- 

2.  Generalizing  the  theory  of  analytic  functions  to  the  case^°  of  a  countably 
infinite  set  of  matrix  arguments  Xi,  X2,  X3,  . . . ,  Lappo-Danilevskii  has 
used  it  to  study  the  behavior  of  a  solution  of  a  system  in  a  neighborhood 
of  an  irregular  singularity."^^    We  quote  the  basic  result. 

The  normalized  solution  Ql  of  the  system 

where  the  power  series  on  the  right-hand  side  converges  for  1 2f  |  <  r  (r  >  1)  ,''2 
can  be  represented  by  a  series 


Ql=E+ 2:        i:      P,r--Pi,y. 

'-i   '■•'■' '•■ — 1  (162) 


Here   ocf^^^         .    and  a^!"^  are  scalar  coefficients  that  are  defined  by 

special  formulas.    The  series  (162)  converges  for  arbitrary  matrices  Pi,  P2, 
...  in  an  annulus 

Q<\z\<r 

(q  is  any  positive  number  less  than  r).     The  point  h  must  also  lie  in  this 
annulus  (q  <  Ih  \  <i  r). 


70  See  [29],  Vol.  I,  Memoir  1. 

71  See  [29],  Vol.  I,  Memoir  3.    See  also  [252],  [253],  [254],  [146],  and  [147]. 

72  The  restriction  r  >  1  is  not  essential,  since  this  condition  can  always  be  obtained  by 
replacing  z  by  az,  where  a  is  a  suitably  chosen  positive  number. 
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Since  in  this  book  we  cannot  possibly  describe  the  contents  of  the  papers 
of  Lappo-Danilevskii  in  sufficient  detail,  we  have  had  to  restrict  ourselves 
to  giving  above  statements  of  a  few  basic  results  and  we  must  refer  the  reader 
to  the  appropriate  literature. 

All  the  papers  of  Lappo-Danilevskii  that  deal  with  differential  equations 
have  been  published  posthumously  in  three  volumes  ( [29]  :  Memoir es  sur  la 
theorie  des  systemes  des  equations  differentielles  lineaires  (1934-36) ).  More- 
over, his  fundamental  results  are  expounded  in  the  papers  [252],  [253], 
[254]  and  the  small  book  [28].  A  concise  exposition  of  some  of  the  results 
can  also  be  found  in  the  book  by  V.  I.  Smirnov  [56],  Vol.  III. 


CHAPTER    XV 

THE  PROBLEM  OF  ROUTH-HURWITZ  AND  RELATED  QUESTIONS 

§  1.     Introduction 

In  Chapter  XIV,  §  3  we  explained  that  according  to  Lyapnnov's  theorem 
the  zero  solution  of  the  system  of  differential  equations 

|*=Ja,A+(**)  (1) 

(Oik  ih  ^  = '^,  2,  .  .  .  ,  n)  are  constant  coefficients)  with  arbitrary  terms  (**) 
of  the  second  and  higher  orders  in  Xi,  X2,  .  .  . ,  Xn  is  stable  if  all  the  character- 
istic values  of  the  matrix  A  =  11  o^fc  lli ,  i-e.,  all  the  roots  of  the  secular  equa- 
tion A{X)  ^\XE  —  A  1=0,  have  negative  real  parts. 

Therefore  the  task  of  establishing  necessary  and  sufficient  conditions 
under  which  all  the  roots  of  a  given  algebraic  equation  lie  in  the  left  half- 
plane  is  of  great  significance  in  a  number  of  applied  fields  in  which  the 
stability  of  mechanical  and  electrical  systems  is  investigated. 

The  importance  of  this  algebraic  task  was  clear  to  the  founders  of  the 
theory  of  governors,  the  British  physicist  J.  C.  Maxwell  and  the  Russian 
scientific  research  engineer  I.  A.  Vyshnegradskii  who,  in  their  papers  on 
governors,^  established  and  extensively  applied  the  above-mentioned  alge- 
braic conditions  for  equations  of  a  degree  not  exceeding  three. 

In  1868  Maxwell  proposed  the  mathematical  problem  of  discovering  cor- 
responding conditions  for  algebraic  equations  of  arbitrary  degree.  Actually 
this  problem  had  already  been  solved  in  essence  by  the  French  mathematician 
Hermite  in  a  paper  [187]  published  in  1856.  In  this  paper  he  had  estab- 
lished a  close  connection  between  the  number  of  roots  of  a  complex  poly- 
nomial f{z)  in  an  arbitrary  half -plane  (and  even  inside  an  arbitrary 
triangle)  and  the  signature  of  a  certain  quadratic  form.     But  Hermite 's 


1  J.  C.  Maxwell,  'On  governors'  Proc.  Roy.  Soc.  London,  vol.  10  (1868)  ;  I.  A.  Vyshne- 
gradskii, 'On  governors  with  direct  action'  (1876).  These  papers  were  reprinted  in  the 
survey  'Theory  of  automatic  governors'  (Izd.  Akad.  Nauk  SSSR,  1949).  See  also  the 
paper  by  A.  A.  Andronov  and  I.  N.  Voznesenskii,  'On  the  work  of  J.  C.  Maxwell,  I.  A. 
Vyshnegradskii,  and  A.  Stodol  in  the  theory  of  governors  of  machines. ' 
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results  had  not  been  carried  to  a  stage  at  which  they  could  be  used  by  spe- 
cialists working  in  applied  fields  and  therefore  his  paper  did  not  receive 
due  recognition. 

In  1875  the  British  applied  mathematician  Routh  [47  ] ,  1 48  ] ,  using  Sturm 's 
theorem  and  the  theory  of  Cauchy  indices,  set  up  an  algorithm  to  determine 
the  number  k  of  roots  of  a  real  polynomial  in  the  right  half -plane  (Re  2  >  0) . 
In  the  particular  case  k  =  0  this  algorithm  then  gives  a  criterion  for  stability. 

At  the  end  of  the  19th  century,the  Austrian  research  engineer  A.  Stodola, 
the  founder  of  the  theory  of  steam  and  gas  turbines,  unaware  of  Routh 's 
paper,  again  proposed  the  problem  of  finding  conditions  under  which  all  the 
roots  of  an  algebraic  equation  have  negative  real  parts,  and  in  1895  A.  Hur- 
witz  [204]  on  the  basis  of  Hermite's  paper  gave  another  solution  (independ- 
ent of  Routh 's).  The  determinantal  inequalities  obtained  by  Hurwitz  are 
known  nowadays  as  the  inequalities  of  Routh-Hurwitz. 

However,  even  before  Hurwitz'  paper  appeared,  the  founder  of  the 
modern  theory  of  stability,  A.  M.  Lyapunov,  had  proved  in  his  celebrated 
dissertation  ('The  general  problem  of  stability  of  motion,'  Kharkov,  1892)^ 
a  theorem  which  yields  necessary  and  sufficient  conditions  for  all  the  roots 
of  the  characteristic  equation  of  a  real  matrix  J.=  11  c^ifc  p  to  have  nega- 
tive real  parts.  These  conditions  are  made  use  of  in  a  number  of  papers 
on  the  theory  of  governors.^ 

A  new  criterion  of  stability  was  set  up  in  1914  by  the  French  mathema- 
ticians Lienard  and  Chipart  [259].  Using  special  quadratic  forms,  these 
authors  obtained  a  criterion  of  stability  which  has  a  definite  advantage  over 
the  Routh-Hurwitz  criterion  (the  number  of  determinantal  inequalities  in 
the  Lienard-Chipart  criterion  is  roughly  half  of  that  in  the  Routh-Hurwitz 
criterion) . 

The  famous  Russian  mathematicians  P.  L.  Chebyshev  and  A.  A.  Markov 
have  proved  two  remarkable  theorems  on  continued-fraction  expansions  of  a 
special  type.  These  theorems,  as  will  be  shown  in  §  16,  have  an  immediate 
bearing  on  the  Routh-Hurwitz  problem. 

The  reader  will  see  that  in  the  sphere  of  problems  we  have  outlined,  the 
theory  of  quadratic  forms  (Vol.  I,  Chapter  X)  and,  in  particular,  the  theory 
of  Hankel  forms  (Vol.  I,  Chapter  X,  §  10)  forms  an  essential  tool. 


§  2.    Cauchy  Indices 

1.    We  begin  with  a  discussion  of  the  so-called  Cauchy  indices. 


2  See  [32],  §20. 

3  See,  for  example,  [102]. 
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Definition  1:  The  Cauchy  index  of  a  real  rational  function  R(x) 
between  the  limits  a  and  h  (notation:  laB(x)  ;  a  and  h  are  real  numbers  or 
±00)  is  the  difference  between  the  numbers  of  jumps  of  R(x)  from  —  00  to 
+  00  and  that  of  jumps  from  +00  to  —  00  as  the  argument  changes  from 
a  to  b.^ 

According  to  this  definition,  if 

t=l  * 

where  Ai,  a*  {i  =  l,  2,  . . . ,  p)  are  real  numbers  and  Bi{x)  is  a  rational 
function^  without  real  poles,  then^ 

ltZB(x)  =  2J^^Ai  (2) 

and,  in  general,  *"'^ 

faR{x)=2^   sgn^i     (a<b).  (2') 

a<ai<h 

In  particular,  ii  fix)  =  a^ix  —  a^)"*  •  •  •  (a;  —  aj"'»  is  a  real  polynomial 
(at  7^  ajc  for  iy^k;  i,k  =  l,  2,  . . . ,  m)  and  if  among  its  roots  ai,  a2,  . . . ,  am 
only  the  first  p  are  real,  then 

^=i.-:^=i.-^,  +  ^xW.  (2") 

where  Bi(x)  is  a  real  rational  function  without  real  poles. 
Therefore,  by  (2')  :  The  index 

ll'fQ      (a<b) 

is  equal  to  the  number  of  distinct  real  roots  of  f{x)  in  the  interval  (a,b). 

An  arbitrary  real  rational  function  B{x)  can  always  be  represented  in 
the  form 

P  f  ^(0  ^(»)      1 

t=i  (x  —  ai  {x—aiP'} 

where  all  the  a  and  A  are  real  numbers  (A^*^  ¥=^\  ^=1)  2,  . . . ,  p)  and 
JKi(x)  has  no  real  poles. 
Then 


^  In  counting  the  number  of  jumps,  the  extreme  values  of  x — the  limits  a  and  & — are 
not  included. 

®  The  poles  of  a  rational  function  are  those  values  of  the  argument  for  vv^hich  the 
function  becomes  infinite. 

■^  By  sign  a  (a  is  a  real  number)  we  mean  +1,  —  1,  or  0  according  as  a  >  0,  a  <  0, 
or  0  =  0. 
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ltZR{x)^       U      sgn^^i^  (3) 

and,  in  general,®  «»odd 


faR{x)=  Z  sgn<)   (a<6).  (3') 


2.    One  of  the  methods  of  computing  the  index /*i?(x)  is  based  on  the 
classical  theorem  of  Sturm. 

We  consider  a  sequence  of  real  polynomials 

fl{x),f2{x),   ...,fm(x)  (4) 

that  has  the  two  following  properties  with  respect  to  the  interval  (a,h)  :^ 

1.  For  every  value  a;  (a  <  x  <  &),  if  any  fkix)  vanishes,  the  two  adja- 
cent functions  fu-i{x)  and  fic+i(x)  have  values  different  from  zero  and  of 
opposite  signs;  i.e.,  for  a  <  a;  <  &  it  follows  from  fk{x)  =0  that 

U-i{x)h+i(x)  <0. 

2.  The  last  function  fm{x)  in  (4)  does  not  vanish  in  the  interval  {a,h)  ; 
i.e.,  fmix)  ^  0  for  a  <  :c  <  &. 

Such  a  sequence  (4)  of  polynomials  is  called  a  Sturm  chain  in  the  inter- 
val (a,h). 

We  denote  by  Vix)  the  number  variations  of  sign  in  (4)  for  a  fixed 
value  ic.^°  Then  the  value  of  V{x),  as  x  varies  from  a  to  h,  can  only  change 
when  one  of  the  functions  in  (4)  passes  through  zero.  But  by  1.,  when  the 
functions  fkix)  {k  =  2,  .  .  . ,  m  —  1)  pass  through  zero,  the  value  of  V(x) 
does  not  change.  When  fiix)  passes  through  zero,  then  one  variation  of 
sign  in  (4)  is  lost  or  gained  according  as  the  ratio  f2(x)/fi{x)  goes  from 
—  00  to  +  00  or  vice  versa.    Hence  we  have : 

Theorem  1  (Sturm)  :  If  fi{x),  f2(x),  . . . ,  fm{x)  is  a  Sturm  chain  in 
(a,  &)  and  Vix)  is  the  number  of  variations  of  sign  in  the  chain,  then 

iJ/^  =  F(o)-F(6).  (5) 


8  In  (3)  the  sum  is  extended  over  all  the  values  i  for  which  the  corresponding  Ui  is  odd. 
In  (3')  the  sum  is  extended  over  all  the  i  for  which  Ui  is  odd  and  a  <i  at  <ii. 

9  Here  a  may  be  —  oo  and  h  may  be  -f  oo  . 

^°  If  a  <  a;  <  &  and  fi.(x)  ^  0,  then  by  1.  in  the  determination  of  V {x)  a  zero  value 
in  (4)  may  be  omitted  or  an  arbitrary  sign  may  be  attributed  to  this  value.  If  a  is  finite, 
then  V{a)  must  be  interpreted  as  y{a-\-E),  where  e  is  a  positive  number  sufficiently 
small  that  in  the  half -closed  interval  {a,a-\-e\  none  of  the  functions  /i(x)  vanishes. 
In  exactly  the  same  way,  if  &  is  finite,  F(b)  is  to  be  interpreted  as  F(b  —  e),  where  the 
number  e  is  defined  similarly. 


176    XV.  The  Problem  of  Routh-Hurwitz  and  Related  Questions 

Note.  Let  us  multiply  all  the  terms  of  a  Sturm  chain  by  one  and  the 
same  arbitrary  polynomial  d{x).  The  chain  of  polynomials  so  obtained  is 
called  a  generalized  Sturm  chain.  Since  the  multiplication  of  all  the  terms 
of  (4)  by  one  and  the  same  polynomial  alters  neither  the  left-hand  nor  the 
right-hand  side  of  (5),  Sturm's  theorem  remains  valid  for  generalized 
Sturm  chains. 

Note  that  if  f(x)  and  g(x)  are  any  two  polynomials  (where  the  degree 
ol  f{x)  is  not  less  than  that  of  g(x)),  then  we  can  always  construct  a  gen- 
eralized Sturm  chain  (4)  beginning  with  fi{x)  ^  f(x),  f2(x)  ^  g(x)  by 
means  of  the  Euclidean  algorithm. 

For  if  we  denote  by  — fsi^)  the  remainder  on  dividing  fi(x)  hy  f2{x), 
by  — f^ix)  the  remainder  on  dividing  f2{x)  by  fsix),  etc.,  then  we  have 
the  chain  of  identities 

fiix)  =  Qiix)  f2{z)  —  fsix) , 

fk-ii^)=qk-iix)fkix)  —  fk+i(^)>  (6) 

where  the  last  remainder  fm{x)  that  is  not  identically  zero  is  the  greatest 
common  divisor  of  f{x)  and  g{x)  and  also  of  all  the  functions  of  the 
sequence  (4)  so  constructed,  li  fm{x)  y^O  {a  <i  x  <i  h)  then  this  sequence 
(4)  satisfies  the  conditions  1.,  2.  by  (6)  and  is  a  Sturm  chain.  If  the 
polynomial  fm{x)  has  roots  in  the  interval  (a,!)),  then  (4)  is  a  generalized 
Sturm  chain,  because  it  becomes  a  Sturm  chain  when  all  the  terms  are 
divided  hy  fm(x). 

From  what  we  have  shown  it  follows  that  the  index  of  every  rational 
function  R(x)  can  be  determined  by  Sturm's  theorem.     For  this  purpose 

it  is  sufficient  to  represent  E(ic)  in  the  form  Q(x)  +  tt-:,  where  Q(x),  f(x), 

g{x)  are  polynomials  and  the  degree  of  g{x)  does  not  exceed  that  of  f{x). 
If  we  then  construct  the  generalized  Sturm  chain  for  f{x),  g{x),  we  have 

By  means  of  Sturm's  theorem  we  can  determine  the  number  of  distinct 
real  roots  of  a  polynomial  fix)  in  the  interval  {a,h),  since  this  number,  as 

we  have  seen,  is  /*  ~^} . 
«/(a:) 
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§  3.     Routh's  Algorithm 

1.    Routh's  problem  consists  in  determining  the  number  k  of  roots  of  a 

real  polynomial  f{x)  in  the  right  half -plane  (Re  0  >  0). 

To  begin  with,  we  treat  the  case  where  f{z)  has  no  roots  on  the  imaginary 

axis.  In  the  right  half -plane  we  construct  the  semicircle  of  radius  R  with 
its  center  at  the  origin  and  we  consider  the  domain 
bounded  by  this  semicircle  and  the  segment  of  the  imagi- 
nary axis  (Fig.  7) .  For  sufficiently  large  R  all  the  zeros 
of  f(z)  with  positive  real  parts  lie  inside  this  domain. 
Therefore  a,vgf{z)  increases  by  2k7i  on  going  in  the 
positive  direction  along  the  contour  of  the  domain. ^^  On 
the  other  hand,  the  increase  of  arg/(2;)  along  the  semi- 
circle of  radius  R  for  i?->  00  is  determined  by  the  in- 
crease of  the  argument  of  the  highest  term  aoZ""  and  is 
therefore  nn.  Hence  the  increase  of  arg/(;2)  along  the 
Fig.  7  imaginary  axis  (R^>  co)  is  given  by  the  expression 


A tZ  arg /  (iw)  =  {n—2k)7t. 


(7) 


We  introduce  a  somewhat  unusual  notation  for  the  coefficients  oi  f{z) 
namely,  we  set 


Then 

where  for  even  n 


f  (z)  =  aoz"  +  6o2"~'  +  «i2"-^  +  &i2"-'  +  •  •  •      («o  v^ 0)- 
t{i(x))  =  U{oy)  +  iV{oj), 


C7  (co)  =  (—  1 )  2  (aoco«  —  aia;«-2  +  a.^af"-^ ) ,        ( 

F  ( w)  =  (—  1 )  2        (6oco"-i  —  \oj''-^  +  62Co"-5 )  j 


(8) 


(8') 


and  for  odd  n 


V{(x))=^{—1)   2    (6ow"-i  — 6iCo"-=^  +  62Co"-5 

n— 1 

F(co)=:(— 1)    2    (aoco"  — aiw"-2  4-a2a;^-* ). 


(8") 


11  For  if  f{z)  —  a^  ![  (z  —  Zi),  then  A  argfiz)  =1  A  arg  (z  —  Zi).    If  the  point 
t  =  1  *  =  1 

Zi  lies  inside  the  domain  in  question,  then  A  arg  {z  —  zi)  ^2k;   if  Zi  lies  outside  the 

domain,  then  A  arg  (z  —  Zi)  =0. 
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Following  Routh,  we  make  use  of  the  Cauchy  index.    Then^^ 


—  AlZa.TgfiiM) 


oo  U{(o)       .  ,.      U{(o)      _ 

-oo  -ttt-  X       for  lim  -fTJ—^  =  0 , 

,+  ooF(a>)„  ,.      V{co)       _  ^^^ 

7_oo  777-;  for  hm  y^  =0. 


The  equations  (8')  and  (8'')  show  that  for  even  n  the  lower  formula  in 
(9)  must  be  taken  and  for  odd  n,  the  upper.  Then  we  easily  obtain  from 
(7),  (8'),  (8''),  and  (9)  that  for  every  n  (even  or  odd)^^ 

^oo&oa>^-i-6,con-3  +  ...^  (10) 

2.    In  order  to  determine  the  index  on  the  left-hand  side  of  (10)  we  use 
Sturm's  theorem  (see  the  preceding  section).    We  set 

/i  (w)  =  aoco"  —  aiC0«-2  +  •  • .,        f^  (co)  =  6oco"-i  —  6ia;«-=^  +  ••  •      (11) 

and,  following  Routh,  construct  a  generalized  Sturm  chain  (see  p.   176) 

/iMJaMJsM.  ..., /^H.  (12) 

by  the  Euclidean  algorithm. 

First  we  consider  the  regular  case :  m  =  n  +  1.  In  this  case  the  degree 
of  each  function  in  (12)  is  one  less  than  that  of  the  preceding,  and  the  last 
function  fm{co)  is  of  degree  zero.^* 

From  Euclid's  algorithm  (see  (6) )  it  follows  that 

U  W^^co /a  (w)  - /i  (w)  =  CoW^-2 -  ciw«-*  +  Cgw"-^ , 

where 

Co-cti-^^h- ^^ ,     Ci-a2-^&2- ^^ ,  ..-.  (13) 


Similarly 


where 


The  coefficients  of  the  remaining  polynomials  f^{(o),  . . . ,  /n+i(co)  are  simi- 
larly determined. 


12  Since  arg  /(ia>)  ;=  arccot  =t^   =  arctan  ^r^  . 

^3  We  recall  that  the  formula  (10)  was  derived  under  the  assumption  that  f(e)  has  no 
roots  on  the  imaginary  axis. 

^*  In  the  regular  case  (12)  is  the  ordinary  (not  generalized)  Sturm  chain. 
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Each  polynomial 


/iH,  /2(w),  ...,  fn+lM 


(U) 


is  an  even  or  an  odd  function  and  two  adjacent  polynomials  always  have 
opposite  parity. 


We  form  the  Routh  scheme 


h, 


dn 


d>i 


(15) 


The  formulas  (13),  (13')  show  that  every  row  in  this  scheme  is  deter- 
mined by  the  two  preceding  rows  according  to  the  following  rule : 

From  the  numbers  of  the  upper  row  we  subtract  the  corresponding  num- 
bers of  the  lower  row  multiplied  by  the  number  that  makes  the  first  differ- 
ence zero.    Omitting  this  zero  difference,  we  obtain  the  required  row. 

The  regular  case  is  obviously  characterized  by  the  fact  that  the  repeated 
application  of  this  rule  never  yields  a  zero  in  the  sequence 

^0'   ^0'  ^0'   •  •  •  • 

Figs.  8  and  9  show  the  skeleton  of  Routh's  scheme  for  an  even  n  ( w  =  6) 
and  an  odd  n  (n^l).  Here  the  elements  of  the  scheme  are  indicated  by 
dots. 

In  the  regular  case,  the  polynomials  fi{co)  and  /2(co)  have  the  greatest 
common  divisor  /„+i(co)  =  const.  ^^  0.  Therefore  these  polynomials,  and 
hence  U{o))  and  V{co)  (see  (8'),  (8'')?  and  (11)),  do  not  vanish  simul- 
taneously; i.e.,  fiico)  =TJ{m)  +  i'V{co)  ^0  for  real  co.  Therefore :  In  the 
regular  case  the  formula  (10)  holds. 

When  we  apply  Sturm's  theorem 
in  the  interval  ( —  oo ,  +  oo  )  to  the  left- 
hand  side  of  this  formula  and  make 
use  of  (14),  we  obtain  by  (10)  : 
F(-oo)— F(+  oo)  =  n-'2k.  (16) 
In  our  case^® 


V{+oo)=V{aQ,  bQ,  CQ,dQ,  .  . .) 


Fig.  8 


Fig.  9 


and 


15  The  sign  of  fk(co)  for  a>  =  -f  oo  coincides  with  the  sign  of  the  highest  coefficient 
and  for  co  = —  oo  differs  from  it  by  the  factor  ( — i)n-k  +  i  (fc  =z  1,  2,  . . . ,  n  -f  1). 
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V  {—  co)=  V  {aQ,  —  bQ,Co,  —  dQ,  ...)• 
Hence 

V{—oo)  =  n—V{+oo),  (17) 

From  (16)  and  (17)  we  find: 

k  =  V  {ttQ,  Iq,  Cq,  dQ,  ...).  (18) 

Thus  we  have  proved  the  following  theorem : 

Theorem  2  (Routh)  :  The  number  of  roots  of  the  real  polynomial  f{z) 
in  the  right  half -plane  Re  2:  >  0  is  equal  to  the  number  of  variations  of  sign 
in  the  first  column  of  Routh' s  scheme. 

3.  We  consider  the  important  special  case  where  all  the  roots  of  f{z)  have 
negative  real  parts  ('case  of  stability').  If  in  this  case  we  construct  for  the 
polynomials  (11)  the  generalized  Sturm  chain  (14),  then,  since  k  =  0,  the 
formula  (16)  can  be  written  as  follows: 

V{—oo)—V(+  00)  =n.  (19) 

But  0^  V{—  00)  ^m  —  l^n  and  0^  V(+  00)  ^m  —  l^n.  There- 
fore (19)  is  possible  only  when  m=^n  +  1  (regular  case !)  and  F(+  00  )  =0, 
V( —  00 )  =  m  —  l  =  n.    The  formula  (18)  then  implies  : 

Routh's  Criterion.  All  the  roots  of  the  real  polynomial  f{z)  have 
negative  real  parts  if  and  only  if  in  the  carrying  out  of  Routh's  algorithm 
all  the  elements  of  the  first  column  of  Routh's  scheme  are  different  from 
zero  and  of  like  sign. 

4.  In  deriving  Routh's  theorem  we  have  made  use  of  the  formula  (10). 
In  what  follows  we  shall  have  to  generalize  this  formula.  The  formula  (10) 
was  deduced  under  the  assumption  that  f{z)  has  no  roots  on  the  imaginary 
axis.  We  shall  now  show  that  in  the  general  case,  where  the  polynomial 
f{z)  =  a^""  4-  6o2;"-i  +  a^  z""-"^  ^ (ao  =7^  0)  has  A;  roots  in  the  right  half- 
plane  and  s  roots  on  the  imaginary  axis,  the  formula  (10)  is  replaced  by 

^+00  6oa>n-l  -  6,a;n-3  +  6,con-^5 =n^2k-S,  (20) 

For 

f{z)=d{z)f*{z), 

where  the  real  polynomial  d{z)  =^  z^  ■\- . . .  has  s  roots  on  the  imaginary 
axis  and  the  polynomial  f*{z)  of  degree  n*  =  n  —  s  has  no  such  roots. 
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For  the  sake  of  definiteness,  we  consider  the  case  where  s  is  even  (the 
case  where  s  is  odd  is  analyzed  similarly). 
Let 

/  {ia))=  U  M  +  iV  {a))=d(iaj)  [U*  (co)  +  iV*  (w)]. 

Since  in  our  case  d(ico)  is  a  real  polynomial  in  co,  we  have 

Ujco)  _U*{cd) 
F(o>)  ~~  V*{(o)' 

Since  n  and  n*  have  equal  parity,  we  find  by  using  (8'),  (8''),  and  the  nota- 
tion (11)  : 

4(tt>)_/LH 

/i(«>)      nico)' 

We  apply  formula  (10)  to /*(  2!).    Therefore 

/i  {(o)  ^  fl  {Oi) 

and  this  is  what  we  had  to  prove. 


§  4.    The  Singular  Case.     Examples 

1.  In  the  preceding  section  we  have  examined  the  regular  case  where  in 
Routh's  scheme  none  of  the  numbers  &o,  Co,  do,  .  .  .  vanish. 

We  now  proceed  to  deal  with  the  singular  cases,  where  among  the  num- 
bers &o,  Co,  . . .  there  occurs  a  zero,  say,  Kq  =  0.  Routh's  algorithm  stops  with 
the  row  in  which  ho  occurs,  because  to  obtain  the  numbers  of  the  following 
row  we  would  have  to  divide  by  ho- 

The  singular  cases  can  be  of  two  types : 

1)  In  the  row  in  which  ho  occurs  there  are  numbers  different  from  zero. 
This  means  that  at  some  place  of  (12)  the  degree  drops  by  more  than  one. 

2)  All  the  numbers  of  the  row  in  which  ho  occurs  vanish  simultaneously. 
Then  this  row  is  the  (m+  l)-th,  where  m  is  the  number  of  terms  in  the 
generalized  Sturm  chain  (12).  In  that  case,  the  degrees  of  the  functions  in 
(12)  decrease  by  unity  from  one  function  to  the  next,  but  the  degree  of 
the  last  function  fm{o))  is  greater  than  zero.  In  both  cases  the  number  of 
functions  in  (12)  is  m  <  n  +  1. 

Since  the  ordinary  Routh^s  algorithm  comes  to  an  end  in  both  cases, 
Routh  gives  a  special  rule  for  continuing  the  scheme  in  the  cases  1),  2) . 
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2.  In  case  1),  according  to  Routh,  we  have  to  substitute  for  /t^  =  0  a  ' small' 
value  £  of  definite  (but  arbitrary)  sign  and  continue  to  fill  in  the  scheme. 
Then  the  subsequent  elements  of  the  first  column  of  the  scheme  are  rational 
functions  of  e.  The  signs  of  these  elements  are  determined  by  the  'small- 
ness'  and  the  sign  of  e.  If  any  one  of  these  elements  vanishes  identically  in  e, 
then  we  replace  this  element  by  another  small  value  rj  and  continue  the 
algorithm. 
Example : 

f(z)=z*  +  z^  +  2z'  +  2z  +  l.  : 

Routh' s  scheme  (with  a  small  parameter  e)  : 

1,        2,    1 

1.        2 

e,         1         A;=F(l,l,e,2— i,l)=2. 

2-1 
e 

1 

This  special  method  of  varying  the  elements  of  the  scheme  is  based  on 
the  following  observation : 

Since  we  assume  that  there  is  no  singularity  of  the  second  type,  the 
functions  fi{co)  and  f2(o))  are  relatively  prime.  Hence  it  follows  that  the 
polynomial  f{z)  has  no  roots  on  the  imaginary  axis. 

In  Routh 's  scheme  all  the  elements  are  expressed  rationally  in  terms  of 
the  elements  of  the  first  two  rows,  i.e.,  the  coefficients  of  the  given  poly- 
nomial. But  it  is  not  difficult  to  observe  in  the  formulas  (13),  (13')  and 
the  analogous  formulas  for  the  subsequent  rows  that,  once  we  have  given 
arbitrary  values  to  the  elements  of  any  two  adjacent  rows  of  Routh 's  scheme 
and  to  the  first  element  of  the  preceding  row,  we  can  express  all  the  elements 
in  the  first  two  rows,  i.e.,  the  coefficients  of  the  original  polynomial,  in 
integral  rational  form  in  terms  of  these  elements.  Thus,  for  example,  all 
the  numbers  a,  h  can  be  represented  as  integral  rational  functions  of 

OfQ,  bo,  Co,  -  ■  ■ ,  Qo,  Qi,  92,  -  -  -  ,  ho,  hi,  h^,  .  .  . 

Therefore,  in  replacing  ^o  =  0  by  a  we  in  fact  modify  our  original  poly- 
nomial. Instead  of  the  scheme  for  f{z)  we  have  the  Routh  scheme  for  a 
polynomial  F{z,  e),  where  F{z,s)  is  an  integral  rational  function  of  z  and  e 
which  reduces  to  fiz)  for  £  =  0.  Since  the  roots  of  i^(2!,  £)  change  continu- 
ously with  a  change  of  the  parameter  £  and  since  there  are  no  roots  on  the 
imaginary  axis  for  £  =  0,  the  number  k  of  roots  in  the  right  half -plane  is  the 
same  for  F{z,  e)  and    F{z,0)  =f{z)  for  values  of  e  of  small  modulus. 
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3..  Let  us  now  proceed  to  a  singularity  of  the  second  type.  Suppose  that 
in  Routh's  scheme 

aoy^0,'bo7^0,  .  ..  ,  eo¥=0,go  =  0,  gi  =  0,  02  =  0,  .  .  . 

In  this  case,  the  last  polynomial  in  the  generalized  Sturm  chain  (16)  is  of 
the  form : 

fm  (<w)  =  eoco»— "»+ 1  —  CjO)"— "»— 1  +  •  •  • . 

Routh  proposes  to  replace  fm+i(co),  which  is  zero,  by  fm(co)  ;  i.e.,  he 
proposes  to  write  instead  of  go,  gi,  ...  the  corresponding  coefficients 

{n  —  w  +  1)  Co,     (n  —  m  —  1)  Cj,  ... 

and  to  continue  the  algorithm. 

The  logical  basis  for  this  rule  is  as  follows : 
By  formula  (20) 

/i(tt>) 

(the  s  roots  ot  f{z)  on  the  imaginary  axis  coincide  with  the  real  roots  of 
fmico))'    Therefore,  if  these  real  roots  are  simple,  then  (see  p.  174) 

and  therefore 

This  formula  shows  that  the  missing  part  of  Routh's  scheme  must  be  filled 
by  the  Routh  scheme  for  the  polynomials  fmico)  and  fm(co).  The  coeffi- 
cients of  fm(co)  are  used  to  replace  the  elements  of  the  zero  row  in  Routh's 
scheme. 

But  if  the  roots  of  fmico)  are  not  simple,  then  we  denote  by  dico)  the 
greatest  common  divisor  oi  fmico)  and  fmico),hy  eico)  the  greatest  common 
divisor  of  dico)  and  d^ico),  etc.,  and  we  have: 

T+oo_CM  +ood^(ft))  +ooe>))  ^ 

^^  fmico)  +  ^—  d{ay)  +  ^—  eico)  '^  "  '         ' 

Thus  the  required  number  k  can  be  found  if  the  missing  part  of  Routh's 
scheme  is  filled  by  the  Routh  scheme  for  fmico)  and  fmico),  then  the  scheme 
for  dico)  and  d\co),  then  that  for  e(co)  and  e'ico),  etc.,  i.e.,  Routh's  rule 
has  to  be  applied  several  times  to  dispose  of  a  singularity  of  the  second  type. 
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Example,  f  (z)  =z^^  +  z^ ^ z^  —  2 z'  +  z'^  -j-  3 z^  +  z^  —  2 z^  —  z^  -{-  z  +  l . 
Scheme 


ft)l« 

1 

—  1 

1 

1 

— 1 

ft)9 

1 

—  2 

3 

—2 

1 

«)« 

1 

2 

3 

—2 

1 

-1 

8 

2 

—12 
—  3 

12 
3 

—4 
— 1 

co«      - 

-1 

3 

—  3 

2 

0)5  1 

3 
1 

—  3 

—  1 

3 
1 

-1 

2 
1 

—  2 

—  1 

2 
1 

..{ 

4 

2 

—  2 

—  1 

0)2        - 

— 1 

2 

ft) 

1 
2 

k=V{h  1, 1,  2,  —1, 1, 1,  2,  —1, 1, 1)  ==  4 


Note.  All  the  elements  of  any  one  row  may  be  multiplied  by  one  and 
the  same  number  without  changing  the  signs  of  the  elements  of  the  first 
column.    This  remark  has  been  used  in  constructing  the  scheme. 

4.  However,  the  application  of  both  rules  of  Routh  does  not  enable  us  to 
determine  the  number  k  in  all  the  cases.  The  application  of  the  first  rule 
(introduction  of  small  parameters  e,  . .  .)  is  justified  only  when  f{z)  has 
no  roots  on  the  imaginary  axis. 

li.  f{z)  has  roots  on  the  imaginary  axis,  then  by  varying  the  parameter  e 
some  of  these  roots  may  pass  over  into  the  right  half -plane  and  change  k. 

Example.      f{z)=z^  +  z^  +  Sz^  +  Sz^  +  3z^  +  2z-hl. 


Scheme 

ft>6 

1 

3 

3 

ft>^ 

1 

3 

2 

CO* 

e 

1 

1 

ft>3 

3-1 

e 

2 

1 
e 

ft)2             J- 

2e  — 1 

3-1 

e 

1 

0) 

u 

co^ 

1 

I  3-1 

e       ,       2e — 1 


F(l,l..,3-l.l,_e.l)={* 


for     e  >  0, 
for     e  <  0. 
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The  question  of  the  value  of  k  remains  open. 

In  the  general  case,  where  f{z)  has  roots  on  the  imaginary  axis,  we  have 
to  proceed  as  follows : 

Setting  f{z)  =Fi(z)  +F2{z),  where 

Fi{z)  ^  aoz""  +  a.i2^-2  +  .  .  . ,         Fziz)  =  hoZ''-''  +  Z^i^"-^  +  .  .  . , 

we  must  find  the  greatest  common   divisor  ^(0)  of  2^1(2)  and  i^2 (2;).    Then 
f{z)=d{z)f*{z). 

1/  f{z)  has  a  root  z  for  which  — ^  is  also  a  root  (all  the  roots  on  the 
imaginary  axis  have  this  property),  then  it  follows  from  /(2)=0  and 
f{—z)  =Othat  2^1(2)  =0andi^2(^)  =0,  i.e.,  0  is  a  root  of  cZ(^).  Therefore 
f*{z)  has  no  roots  z  for  which  —  2;  is  also  a  root  oi  f*{z). 

Then 

A;  =  A^i  +  ^2, 

where  ki  and  k2  are  the  respective  numbers  of  roots  of  /*(2!)  and  d{z)  in  the 
right  half -plane ;  fci  is  determined  by  Routh's  algorithm  and  /c2  =  {q  —  5)  /2, 
where  q  is  the  degree  oid{z)  and  s  the  number  of  real  roots  oid{ia>)}^ 
In  the  last  example, 

d{z)=z''  +  l,    f''iz)=z^  +  z'  +  2z'  +  2z  +  l. 

Therefore  (see  example  on  p.  182),  we  have  k2  =  0,  ki  =  2,  and  hence 

k  =  2. 


§  5.    Lyapunov's  Theorem 

1.  From  the  investigations  of  A.  M.  Lyapunov  published  in  1892  in  his 
monograph  'The  General  Problem  of  Stability  of  Motion'  there  follows  a 
theorem^^  that  gives  necessary  and  sufficient  conditions  for  all  the  roots  of 
the  characteristic  equation  \  XE  —  A  |  =0  of  a  real  matrix  A=  11  a<fc  ||i  to 
have  negative  real  parts.     Since  every  polynomial 

f{k)  =  aoX""  +  aiA"-i  +  ...-^On         (ao  v^  0) 


1®  d  {i  u>)  is  a  real  polynomial  or  becomes  one  after  cancelling  i.    The  number  of  its  real 
roots  can  be  determined  by  Sturm 's  theorem. 
17  See  [32],  §20. 
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can  be  represented  as  a  characteristic  determinant  \  XE  —  A\,^^  Lyapunov's 
theorem  is  of  general  character  and  is  applicable  to  an  arbitrary  algebraic 
equation  f(X)  =0. 

Suppose  given  a  real  matrix  A  =  11  aijc  11 1  and  a  homogeneous  polynomial 
of  dimension  m  in  the  variables  xi,  X2,  . . . ,  Xni 

V{x,x,...,x)         {x  =ixi,  arg,  .  . . ,  xj) . 


Let  us  find  the  total  derivative  with  respect  to  ^of  V{x,  x, 
the  assumption  that  x  is  a  solution  of  the  differential  system 


Then 


dx        . 


,  x)  under 


^V{x,x,...,x)  =  V {Ax,  X,  ...,x) 

+  V(x,Ax,...,x)  +  "-  +  V{x,x,...,Ax) 
=  W{x,  X,  . . .,  x), 


(21) 


where  W{x,  x,  .  .  . ,  x)  is  again  a  homogeneous  polynomial  of  dimension  m 
in  Xi,  X2, . . . ,  Xn.  The  equation  (21)  defines  a  linear  operator  A  which  asso- 
ciates with  every  homogeneous  polynomial  of  dimension  m  V(x,  x,  . . . ,  x) 
a  certain  homogeneous  polynomial  W{x,  x,  . . . ,  x)  of  the  same  dimension  m 

W=A{V). 

We  restrict  ourselves  to  the  case  m  =  2.^^    Then  V{x,x)  and  W(x,x) 
are  quadratic  forms  in  the  variables  Xi,  X2, . . . ,  Xn  connected  by  the  equation 


;^  V{x,  x)=V{Ax,  x)  +  V(x,  Ax)=W(x,  x)-. 


dt 


hence^^ 


W  =  A{V)=A''V+VA. 


^8  For  this  purpose  it  is  sufficient  to  set,  for  example: 


A  = 


^^  A.  M.  Lyapunov  has  proved  his  theorem  for  every  positive  integer  m. 
20  Because  V(x,y)  =  x^Vy. 


0 

0  . 

.  0 

1 

0  . 

.  0 

0 

0  . 

.  1 

(22) 
(23) 
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Here  V=  11  Vik  11 1  and  ^=11  f^ik  11 1  are  symmetric  matrices  formed, 
respectively,  from  the  coefficients  of  the  forms  V(x,x)  and  W{x,x).  The 
linear  operator  A  in  the  space  of  matrices  of  order  n  is  completely  deter- 
mined by  specification  of  the  matrix  A=  ||  aik  Iji . 

If  Ai,  A2,  .  .  .  ,  An  are  the  characteristic  values  of  the  matrix  A,  then  every 
characteristic  value  of  the  operator  A  can  be  represented  in  the  form 
Ai  +  Afc  (1^^,  k^n).'' 

Therefore,  if  the  matrix  A=^  11  Oi^  11 1  has  no  zero  characteristic  value  and 
no  two  that  are  opposites,  then  the  operator  A  is  non-singular.  In  this  case 
the  matrix  TT  in  (23)  determines  the  matrix  V  uniquely. 

If  V  is  symmetric,  then  the  matrix  W  defined  by  (23)  is  also  symmetric. 
If  ^  is  a  non-singular  operator,  then  the  converse  statement  also  holds: 
Every  symmetric  matrix  W  corresponds  by  (23)  to  a  symmetric  matrix  7. 
For  in  this  case  we  find,  by  going  over  to  the  transposed  matrices  on  both 
sides  of  (23) ,  that  the  matrix  V",  as  well  as  V,  satisfies  (23) .  By  the  unique- 
ness of  the  solution,  V^  =  V. 

Thus :  If  the  matrix  A  =  11  Oijc  11 1  has  no  zero  and  no  two  opposite  char- 
acteristic values,  then  every  quadratic  form  W{x,x)  corresponds  to  one  amd 
only  one  quadratic  form  V{x,x)  connected  with  W{x,x)'by  (22). 

Now  we  can  formulate  Lyapunov's  theorem. 

Theorem  3  (Lyapunov)  :  If  all  the  characteristic  values  of  the  real 
matrix  A  =  11  Oi^  p  have  negative  real  parts,  then  to  every  negative-definite 
quadratic  form  W{x,x)  there  corresponds  a  positive-definite  quadratic  form 
y{x,x)  connected  with  W {x,  x) — taking 

dx 

j-  =  Ax  (24) 

i7hto  account — hy  the  equation 

^V{x,x)  =  W{x,x).  (25) 

Conversely,  if  for  every  negative-definite  form  W(x,x)  there  exists  a 
positive-definite  form  V{x,x)  connected  with  W{x,x)  hy  the  equation  (25) 
— taking  (24)  into  account — then  all  the  characteristic  values  of  the  matrix 
A.=  II  aik  II 1  have  negative  real  parts. 

Proof.  1.  Suppose  that  all  the  characteristic  values  of  A  have  negative 
real  parts.    Then  for  every  solution  x  =  e^^iCo  of  (24)  we  have  lim  x  =  0.^^ 

Suppose  that  the  forms  Vix,x)  and  W{x,x)  are  connected  by  (25)  and  that 


21  See  footnote  18. 
^  22  See  Vol.  I,  Chapter  V,  §  6. 
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W{x,x)    <0   (Xy^O).^^ 

Let  us  assume  that  for  some  Xoj^o 

Vq  =  V{xo,Xq)^0. 
But  j^  V{x,  x)  =  W{x,  x)  <0  {x=  e^^  x^).    Therefore  for  #  >  0  the  value 

oiViXfX)  is  negative  and  decreases  for  t-^  oo,  which  results  in  a  contradic- 
tion to  the  equation  lim  V{x,x)  =  lim  Vix,x)  =0.    Therefore  V(x,x)  >  0 

for  X  7^  0,  i.e.,  V{x,  x)  is  a  positive-definite  quadratic  form. 
2.    Suppose,  conversely,  that  in  (25) 

W{x,x)  <0,     V{x,x)>0         (xy^o). 

From  (25)  it  follows  that 

t 
V  {x,  x)  =  V  {xq,  Xq)+  JW  {x,  x)  dt        {x  =  e^'xQ) .  (25') 

0 

We  shall  show  that  for  every  Xq-t^  o  the  column  x  =  e'^^Xo  comes  arbitrarily 
near  to  zero  for  arbitrarily  large  values  of  #  >  0.  Assume  the  contrary. 
Then  there  exists  a  number  v  >  0  such  that 

W{x,x)<  —  v<0         {x=e^^XQ,    Xq^o,    t>0). 

But  then  from  (25') 

V [x,  x)<V {xq,  Xq)  —  vt , 

and  so  for  sufficiently  large  values  of  t  we  have  V{XyX)  <  0,  which  contra- 
dicts our  assumption. 

From  what  we  have  shown,  it  follows  that  for  certain  sufficiently  large 
values  of  t  the  value  oiY{x,x)  (ic  =  e^^Xo,  Xo  ^^  0)  will  be  arbitrarily  near 

to  zero.    But  YiXfX)  decreases  monotonically  for  #  >  0,  since -^  Y{x,x)  = 

W{x,x)  <  0.    Therefore  lim  F (a;,  a;)  =  0. 

Hence  it  follows  that  for  every  Xq^q,  lime^*rco  =  o,  i.e.,  lime^'  =  0. 

This  is  only  possible  if  all  the  characteristic  values  of  A  have  negative  real 
parts  (see  Vol.  I,  Chapter  V,  §  6). 

The  theorem  is  now  completely  proved. 

For  the  form  W^x^x)  in  Lyapunov's  theorem  we  can  take  any  negative- 

n 

definite  form,  in  particular,  the  form  — ^  x?.     In  this  case  the  theorem 

1=1 
admits  of  the  following  matrix  formulation : 


23  The  form  W(x,x)  is  given  arbitrarily.  The  form  V(x,x)  is  uniquely  determined 
by  (25),  because  A  has  in  this  case  neither  the  characteristic  value  zero  nor  pairs  of 
opposite  characteristic  values. 
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Theorem  3' :    All  the  characteristic  values  of  the  real  matrix  A  =  Ij  anc  11 1 
have  negative  real  parts  if  and  only  if  the  matrix  equation 

A'V  +VA  =  —  E  (20) 

has  as  its  solution  V  the  coefficient  matrix  of  some  positive-definite  quad- 
ratic form  V{x,x)  >  0. 

2.    From  this  theorem  we  derive  a  criterion  for  determining  the  stability  of 
a  non-linear  system  from  its  linear  approximation. ^^ 

Suppose  that  it  is  required  to  prove  the  asymptotic  stability  of  the  zero 
solution  of  the  non-linear  system  of  differential  equations  (1)  (p.  172)  in 
the  case  where  the  coefficients  Oik  {i,k  =  l,  2,  .  .  . ,  n)  in  the  linear  terms 
on  the  right-hand  side  form  a  matrix  A=  11  aijc  11 1  having  only  characteristic 
values  with  negative  real  parts.  Then,  if  we  determine  a  positive-definite 
form  V(x,x)  by  the  matrix  equation  (26)  and  calculate  its  total  derivative 
with  respect  to  time  under  the  assumption  that  ic—  (iCi,  X2,  .  .  . ,  Xn)  is  a 
solution  of  the  given  system  ( 1 ) ,  we  have : 


dt 


V{X,  X)=  —  2J^I-^R{^1,  ^2,  ••',  ^J 


where  B{xx,  X2,  . . . ,  Xn)  is  a  series  containing  terms  of  the  third  and  higher 
total  degree  in  x^,  X2,  . .  .  y  a:„.  Therefore,  in  some  sufficiently  small  neigh- 
borhood of  (0,  0,  . . . ,  0)  we  have  simultaneously  for  every  Xy^o 

V{x,x)>0,     ^^V{x,x)<0. 

By  Lyapunov^s  general  criterion  of  stability^^  this  also  indicates  the 
asymptotic  stability  of  the  zero  solution  of  the  system  of  differential  equa- 
tions. 

If  we  express  the  elements  of  Y  from  the  matrix  equation  (26)  in  terms 
of  the  elements  of  A  and  substitute  these  expressions  in  the  inequalities 


^11  >0 


^21       ^22 


>0 


^11       ^12 
"^21       ^22 


'nl 


'n2 


'±n 


>o, 


then  we  obtain  the  inequalities  that  the  elements  of  a  matrix  A=  1]  aiu  11? 
must  satisfy  in  order  that  all  the  characteristic  values  of  the  matrix  should 


24  See  [32] ,  §  26 ;  [9] ,  pp.  113  f f . ;  [36] ,  pp.  66  f f . 

25  See  [32],  §  16;   [9],  pp.  19-21  and  31-33;   [36],  pp.  32-34. 
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have  negative  real  parts.  However,  these  inequalities  can  be  obtained  in  a 
considerably  simpler  form  from  the  criterion  of  Routh-Hurwitz,  which  will 
be  discussed  in  the  following  section. 

Note.  Lyapunov's  theorem  (3)  or  (3')  can  be  generalized  immediately 
to  the  case  of  an  arbitrary  complex  matrix  A=  11  aik  11 1.  The  quadratic 
forms  YiXyX)  and  W{x,x)  are  then  replaced  by  Hermitian  forms 

n  n 

V  {X,  X)=2J  ^ii^i^k  >       W{X,X)=^  ^ilP^iXjc  - 
i,*=l  t,A:=l 

Correspondingly,  the  matrix  equation  (26)  is  replaced  by  the  equation 


§  6.    The  Theorem  of  Routh-Hurwitz 

1.  In  the  preceding  sections  we  have  explained  the  method  of  Routh,  un- 
surpassed in  its  simplicity,  of  determining  the  number  h  of  roots  in  the  right 
half-plane  of  a  real  polynomial  whose  coefficients  are  given  as  explicit 
numbers.  If  the  coefficients  of  the  polynomial  depend  on  parameters  and 
it  is  required  to  determine  for  what  values  of  the  parameters  the  number  k 
has  one  value  or  another — in  particular,  the  value  0  ('domain  of  stability')^® 
— then  it  is  desirable  to  have  explicit  expressions  for  the  values  of  Co,  do,  . . . 
in  terms  of  the  coefficients  of  the  given  polynomial.  In  solving  this  problem, 
we  obtain  a  method  of  determining  k  and,  in  particular,  a  stability  criterion 
in  a  form  in  which  it  was  established  by  Hurwitz  [204]. 
We  again  consider  the  polynomial 

/  (2)  =  a^^  +  6o2«-i  +  aiZ«-2  -F  \z^-^  +  •  •  •     (ao  v^  0) . 
By  the  Hurwitz  matrix  we  mean  the  square  matrix  of  order  n 


H  = 


..  b 


&o   6i   &2  • 

«0    ^1    ^2   • 

0    bQ  b^  ...  b 
0   Uq  tti  . . 

0    0    6„  ...   6 


n-l 
K-1 


n-2 
^n-2 


n-3 


ttj^^O    for    k> 
bj^=0    for   k> 


(27) 


26  For  this  is  precisely  the  situation  in  planning  new  mechanical  or  electrical  systems 
of  governors. 
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We  transform  the  matrix  by  subtracting  from  the  second,  fourth,  . 
rows  the  first,  third,  .  .  ,  row,  multiplied  by  ao/^o.''    We  obtain  the  matrix 


60  W  62 

0  Co  Ci 

0  K  &i 

0  0  Co 

0  0  6n 


*«-2 


'n-2 


In  this  matrix  Co,  Ci,  . . .  is  the  third  row  of  Routh  's  scheme  supplemented  by 
zeros  (cfc  =  0  for /c  >  [n/2] — 1). 

We  transform  this  matrix  again  by  subtracting  from  the  third,  fifth,  . .  . 
rows  the  second,  fourth,  . . .  row,  multiplied  by  ho/coi 


bo 

h 

&2 

63  ... 

0 

Co 

Cl 

C2  ... 

0 

0 

do 

d,  ... 

0 

0 

Co 

Ci  ... 

0 

0 

0 

do  ... 

0 

0 

0 

Co  ... 

Continuing  this  process,  we  ultimately  arrive  at  a  triangular  matrix  of 
order  n 


R  = 


&0     &i     &2 

0     Co    Ci 

0       0       dn 


(28) 


which  we  call  the  Routh  matrix.  It  is  obtained  from  Routh 's  scheme  (see 
(15) )  by :  1)  deleting  the  first  row;  2)  shifting  the  rows  to  the  right  so  that 
their  first  elements  come  to  lie  on  the  main  diagonal ;  and  3 )  completing  it 
by  zeros  to  a  square  matrix  of  order  n. 


We  begin  by  dealing  with  the  regular  case  where  bo  7^  0,  Coj^  0,  do  7^  0, 
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Definition  2 :  Two  matrices  A  =  ||  aik  ||i  and  B  =  ||  hik  11?  will  he  called 
equivalent  if  and  only  if  for  every  p-^n  the  corresponding  minors  of  order 
p  in  the  first  p  rows  are  equal : 


/I  2....^       /I  2..., 


,   i^,  ' ' .,  I'p  — 1,2,..., 

Since  we  do  not  change  the  values  of  the  minors  of  order  p  in  the  first 
p  rows  when  we  subtract  from  any  row  of  the  matrix  an  arbitrary  multiple 
of  any  preceding  row,  the  Hurwitz  and  Routh  matrices  H  and  R  are  equiva- 
lent in  the  sense  of  Definition  2 : 


H 


1    2 


=  R 


1    2 


1,2, 


.  (29) 


.    "2?    •  •  -J    "p 

'1    "2  •  •  •    "p/  X"!     "2  •  •  •    "p/  \  p^\,  2,  . . .,  n 

The  equivalence  of  the  matrices  H  and  B  enables  us  to  express  all  the 
elements  of  E,  i.e.  of  the  Routh  scheme,  in  terms  of  the  minors  of  the  Hurwitz 
matrix  H  and,  therefore,  in  terms  of  the  coefficients  of  the  given  polynomial. 
For  when  we  give  to  p  in  (29)  the  values  1,  2,  3,  . . .  in  succession,  we  obtain 


H 

0= 

--bo, 

■-Q- 

=  &i, 

H 

\3; 

=  &2>    •  •  •> 

< 

> 

=  Vo» 

/I   2\ 

%  3)= 

=Vi> 

/I 
H 

\1 

2\ 
4/" 

=  V2'    •••> 

H 

c: 

3\ 
3J~ 

=  boCodo, 

H 

/I   2  3\ 
\1   2  4J" 

=  ^O^O^l' 

H 

/I   2 
J   2 

3\ 
5/" 

=  &0^0^2»    •  •  • 

,    (30) 


etc. 

Hence  we  find  the  following  expressions  for  the  elements  of  Routh 's 
scheme : 


'.-(;)• 

'.-©■ 

"--%) 

""  "(1)  ■ 

(31.) 
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The  successive  principal  minors  of  H  are  usually  called  the  Hurwitz 
determinants.    We  shall  denote  them  by 


i.=^(;)-- 


,    ^n  =  H 


A,=H 


1  2 
1  2 


6o    6i  . 

0      b,. 
0      an . 


K-1 


'n-2 


(32) 


Note  1.    By  the  formulas  (30) ,'' 


(33) 


From  Ai^O,  .  .  . ,  zip  7^  0  it  follows  that  the  first  p  of  the  numbers 
ho,  Co,  .  . .  are  different  from  zero,  and  vice  versa ;  in  this  case  the  p  successive 
rows  of  Routh  's  scheme  beginning  with  the  third  are  completely  determined 
and  the  formulas  (31)  hold  for  them. 

Note  2.  The  regular  case  (all  the  bo,  Co,  ...  have  a  meaning  and  are 
different  from  zero)  is  characterized  by  the  inequalities 


zli^O,     zl2^0, 


Any^O. 


Note  3.    The  definition  of  the  elements  of  Routh 's  scheme  by  means  of 
the  formulas  (31)  is  more  general  than  that  by  means  of  Routh 's  algorithm. 

Thus,  for  example,  if  ho  =  hI  j=  0,  then  Routh 's  algorithm  does  not  give  us 

anything  except  the  first  two  rows  formed  from  the  coefficients  of  the  given 
polynomial.    However  if  for  Ai  =  0  the  remaining  determinants  ZI2,  A3,  .  .  . 
are  different  from  zero,  then  by  omitting  the  row  of  c's  we  can  determine 
by  means  of  the  formulas  ( 31 )  all  the  remaining  rows  of  Routh 's  scheme. 
By  the  formulas  (33), 


A,      c,=  ^^, 


d  -^ 


and  therefore 


28  If  the  coefficients  of  f(z)  are  given  numerically,  then  the  formulas  (33) — reducing 
this  computation,  as  they  do,  to  the  formation  of  the  Routh  scheme — give  by  far  the 
simplest  method  for  computing  the  Hurwitz  determinants. 
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V  (ao,  60.  Co, . . .)  =v[ao,  J^, ^,  . . .,  ^)  =  F {a„  A^,  A^, . . .)  +  F(l,  A^,  A^, . .  .)• 

Hence  Routh's  theorem  can  be  restated  as  follows: 

Theorem  4  (Routh-Hurwitz)  :    The  number  of  real  roots  of  the  poly- 
nomial f{z)  =  doZ*"  +  . .  .in  the  right  half -plane  is  determined  hy  the  formula 


^=^K^4:4:'-'2ti) 


(34) 


or  (what  is  the  same)  hy 

k  =  V  (ao,  ^v  ^3,  •  •  •)  +  F  (1,  ZI2,  A^,  . . .).  (340 

Note.  This  statement  of  the  Routh-Hurwitz  theorem  assumes  that  we 
have  the  regular  case 

A^y^O,     A2¥=0,    ...,     An¥^0. 

In  the  following  section  we  shall  show  how  this  formula  can  be  used  in 
the  singular  cases  where  some  of  the  Hurwitz  determinants  Ai  are  zero. 

2.  We  now  consider  the  special  case  where  all  the  roots  oi  f(z)  are  in  the 
left  half -plane  Re  2:  <  0.  By  Routh's  criterion,  all  the  ao,  ho,  Co,  do,  . . .  must 
then  be  different  from  zero  and  of  like  sign.  Since  we  are  concerned  here 
with  the  regular  case,  we  obtain  from  (34)  for  A;  =  0  the  following  criterion : 

Criterion  of  Routh-Hurwitz  :  All  the  roots  of  the  real  polynomial 
fiz)=  aoZ^  +  .  .  .  (aoy<^0)  have  negative  real  parts  if  and  only  if  the  in- 
equalities 

^      s  M  .  an^n>0  {for  odd  n),        1 

aoA>0,    4>0,    ao4>0,    A>0,  ...,    V"     ,    ,  \         1(35) 

^„>0  {for  even  n)        ] 

hold. 

Note.    If  Oo  >  0,  these  conditions  can  be  written  as  follows : 

Ji>0,    Zl2>0,  ...  ,  J„>0.  (36) 

If  we  use  the  usual  notation  for  the  coefficients  of  the  polynomial 

/  (z)  =  ao2"+  aiz""'  +  ag^""'  +  •  •  •  +  ^n-i^  +  ««. 

then  for  ao  >  0  the  Routh-Hurwitz  conditions  (36)  can  be  written  in  the 
form  of  the  following  determinantal  inequalities : 
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«!  I  >  0, 


ai   (Zi 


>0, 


«1 

«3 

«5 

«o 

^2 

«4 

0 

«1 

^3 

>o, 


»1     «3     «5     ••• 

do  a^  a^   .. 
0    tto  aj   ... 

0 
0 
0 
0 

.  a. 

>  0.       (36') 


A  real  polynomial  f(z)  =<ioZ*^  +  . . .  whose  coefficients  satisfy  (35),  i.e., 
whose  roots  have  negative  real  parts,  is  often  called  a  Hurwitz  polynomial. 

3.  In  conclusion,  we  mention  a  remarkable  property  of  Routh  's  scheme. 

Let  /o,  /i, . . .  and  go,  Qi,  •  •  -  be  the  (m  +  l)-th  and  (m  +  2)-th  rows  of  the 
scheme  {fo  =  Am/Am-i,  go  =  Am+i/^m)-  Since  these  two  rows  together 
with  the  subsequent  rows  form  a  Routh  scheme  of  their  own,  the  elements 
of  the  (m  +  p  +  l)-th  row  (of  the  original  scheme)  can  be  expressed  in 
terms  of  the  elements  of  the  (m  +  l)-th  and  (m  +  2)-th  rows  /o,  /i,  .  •  •  and 
go,  gi,  ...  by  the  same  formulas  as  the  (p  +  l)-th  row  can  in  terms  of  the 
elements  of  the  first  two  rows  do,  fli,  . . .  and  ho,  hi,  . . .  ;  that  is,  if  we  set 


9^0    9i    92 
h    /i    h 

0     /o     /i 


H  = 


then  we  have 

1 . . .  w  +  p —  1 
m  +  p —  1 

^/l...m  +  2>— 1\ 
\l...m  +  p—lj 


^/l...w  +  p— 1  m  +  p        \        ~/l...p— 1  p        \ 

\l...m  +  p—l     m-i-p  +  Jc—l)  _      \l...p  —  l     p  +  k—l) 


'<::.l-l) 


(37) 


The  Hurwitz  determinant  Am+p  is  equal  to  the  product  of  the  first  m  +  p 
numbers  in  the  sequence  ho,  Co, . . .  : 


^m+p  —  OqCq  .  . .  fQgQ  ...  Iq. 


But 


^m  =  Kcq  . . .  /o»     Ap^gQ  . ..  Iq. 
Therefore  the  following  important  relation^^  holds : 


^m+p —  AmAp' 


(38) 


29  Here  A^  is  the  minor  of  order  p  in  the  top  left-hand  corner  of  H. 


196    XV.  The  Problem  of  Routh-Hurwitz  and  Related  Questions 

The  formula  (38)  holds  whenever  the  numbers  /o,  /i,  . . .  and  go,  gi,  ... 
are  well  defined,  i.e.,  under  the  conditions  A^-i  ¥=0,  Amy^  0. 

The  formula  (37)  has  a  meaning  if  in  addition  to  the  conditions  Am-ij^O, 
^m  =7^  0  we  also  have  Am+p-i¥=0.  From  this  condition  it  follows  that  the 
denominator  of  the  fraction  on  the  right-hand  side  of  (37)  is  also  different 
from  zero :   Ap-i^O. 


§  7.    Orlando's  Formula 


1.  In  the  discussion  of  the  cases  where  some  of  the  Hurwitz  determinants 
are  zero  we  shall  have  to  use  the  following  formula  of  Orlando  [294],  which 
expresses  the  determinant  zl„_i  in  terms  of  the  highest  coefficient  do  and  the 
roots  01,  Z2,  '",  2f«of  f(z)  :^° 


njn—l)  1,  ....  n 

4-1 =(-1)  '    <-^  n  {z,^z,). 


(39) 


»<* 


For  n  =  2  this  reduces  to  the  well-known  formula  for  the  coefficient  &o 
in  the  quadratic  equation  aoZ^  -h  hoZ  +  ai  =  0 : 

Aj^=bQ  =  —  aQ{zi  +  Z2). 

Let  us  assume  that  the  formula  (39)  is  true  for  polynomials  of  degree  n, 
f{z)=  ttoz"  -\-  hffi^"^  +  . . .  and  show  that  it  is  then  true  for  polynomials  of 
degree  n-^1 

F{z)={z  +  h)f{z) 

=  ao3'^+i  +  (6o  +  hao)  2"  +  {a^  +  h\)  2"-^  -f  •  •  •  (^  =  -z„+i). 

For  this  purpose  we  form  the  auxiliary  determinant  of  order  n  +  \ 


D  = 


bi'"K-i 


0      6, 
0     a 


0  • 


^n-l 
>«-2 


0  •  •  •  "n-2 


0      0 


(-!)• 


0     for    A;  > 


ht  =  0     for    k> 


30  The  coefficients  of  f(z)  may  be  arbitrary  complex  numbers. 
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We  multiply  the  first  row  of  B  by  ao  and  add  to  it  the  second  row  multi- 
plied by  —  &o,  the  third  multiplied  by  ai,  the  fourth  by  —  61,  etc.  Then 
in  the  first  row  all  the  elements  except  the  last  are  zero,  and  the  last  element 
isf{h).    Hence  we  deduce  that 

D  =  {-l)''A„_J{h). 

On  the  other  hand,  when  we  add  to  each  row  of  D  (except  the  last)  the 
next  multiplied  by  h  we  obtain,  apart  from  a  factor  ( — 1)",  the  Hurwitz 
determinant  Al  of  order  n  for  the  polynomial  F(z)  : 


i)  =(-!)« 


Thus 


}q  +  httQ    61  +  ha^ 


«i 


hL 


60  +  httQ 


(-1)"^:. 


Zi;  =  ^„_i/  [h)  =  a,A,_,  n  {h-z,) . 


i=l 


When  we  replace  An-i  by  its  expression  (39)  and  seth  =  —  Zn+i,  we  obtain 

(»+!)»         1,  ...,n+l 

A:={-1)     2      al     n     {z,  +  z,). 

i<k 

Thus,  by  mathematical  induction  Orlando's  formula  is  established  for 
polynomials  of  every  degree. 

From  Orlando's  formula  it  follows  that:  Zl„_i  =  0  if  and  only  if  the  sum 
of  two  roots  of  f{z)  is  zero.^^ 

Since  An  =  cAn-i,  where  c  is  the  constant  term  of  the  polynomial  f(z) 
(c=  ( — l)*^aoZiZ2  . .  .Zn),  it  follows  from  (39)  that: 


n(ra-hl)  1, ...,  n 

^n  =(-  1)       '         <^lH  '"^n    ^     (2i  +  ^k)  • 


(40) 


i<k 


The  last  formula  shows  that:  Zl„  vanishes  if  and  only  if  f{z)  has  a  pair 
of  opposite  roots  z  and  —  z. 


31  In  particular,  J„_i  =  0  when  f{z)  has  at  least  one  pair  of  conjugate  pure  imaginary 
roots  or  multiple  zero  roots. 
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§  8.    Singular  Cases  in  the  Routh-Hurwitz  Theorem 

In  discussing  the  singular  cases  where  some  of  the  Hurwitz  determinants 
are  zero,  we  may  assume  that  Any^O  (and  consequently  An-iy^O). 

For  if  An^O,  then,  as  we  have  seen  at  the  end  of  the  preceding  section, 
the  real  polynomial  f{z)  has  a  root  2:'  for  which  —  z^  is  also  a  root.  If  we  set 
f{z)=Fi{z)  +F2{z),  where 

Fi  (2)  =  «o2"  +  «i2"-'  +  •  •  •,    F^  (2)  =  hz^-^  +  6iZ«-3  +  . . ., 

then  we  can  deduce  from  f{z')=f{—z')=0  that  i^i(e')  =i^2(^')  =0. 
Therefore  z'  is  a  root  of  the  greatest  common  divisor  d{z)  of  the  polynomials 
Fi{z)  and  F2{z).  Setting  f  (z)  =  d{z) f* {z) ,  we  reduce  the  Routh-Hurwitz 
problem  for  f{z)  to  that  for  the  polynomial  p{z)  for  which  the  last  Hurwitz 
determinant  is  different  from  zero. 

1.    To  begin  with,  we  examine  the  case  where 

A,  =  .. 


From  Ai  =  0  it  follows  that  6o  =  0;  from  ^2  = 
follows  that  hi  =  0.    But  then  we  have  automatically 


Any^O. 

0     61 


(41) 
—  0,0^1  =  0  it 


From 


A,= 


A  = 


0 

b^ 

h 

do 

«i 

«2 

0 

0 

&1 

—  a^bl  =  0. 


0  0  &2  63 

«0  «1  «2  ®3 

0  0  0  62 

0  Cfo  «1  ^2 


it  follows  that  &2  =  0  and  then  zlg  =  —  c^m  =  0,  etc. 

This  argument  shows  that  in  (41)  p  is  always  an  odd  number  p  =  2h  —  1. 
Then  &«  =  &i  =  &2  =  •  • .  =  &»-!  =  0,hn¥=  0,  and^^ 

Let  us  vary  the  coefficients  ho,  Z>i,  . . . ,  &7i_i  in  such  a  way  that  for  the 
new,  slightly  altered  values  &o*,  &i*,  •  •  • ,  &*-i  all  the  Hurwitz  determinants 
Ai*,  A2*,  . . . ,  An*  become  different  from  zero  and  zip*  1, ... ,  An*  keep  their 
previous  signs.  We  shall  take  ho*,  hi*,  . . . ,  ht+i  as  'small'  values  of  differ- 
ent orders  of  'smallness' ;  indeed,  we  shall  assume  that  every  hj-i  is  in  abso- 


32  From  (42)  it  follows  that  for  odd  h  sign  Ap^2^^  ( — ^)     ^      sign  Oo,  and  for  even 
h 
h  sign  Ap+i=  i—l)^  . 
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lute  value  'considerably'  smaller  than  hj*  (j  =  l,  2,  . . . ,  h;  hn*  =  hh) •  The 
latter  means  that  in  computing  the  sign  of  an  integral  algebraic  expression 
in  the  h*  we  can  neglect  terms  in  which  some  hi*  have  an  index  less  than 
j  in  comparison  with  terms  where  all  the  hi*  have  an  index  at  least  j. 
We  can  then  easily  find  the  'sign-determining'  terms  of  At,  At,  . . . ,  Ap 
(p  =  2/i  — 1):3« 

etc.;  in  general, 

7  (7+1) 


4-=(-l)    '*?+•••  (?^1'2'  •••'  ^-1)' 


7  0+1) 


4-+i=(-i)    '    4/^^^+'--  (?=0,1,  ...,  A-1). 


(43) 


We  choose  &o*,  &i*,  . . . ,  &2ft-i  as  positive ;  then  the  sign  of  Ai*  is  determined 
by  the  formula 

?o+i)  /      r  '1  \ 

sign4=(-l)     2     signaj  (?  =  [yj,  »  =  1,  2,  . . .,  pj.  (44) 


In  any  small  variation  of  the  coefficients  of  the  polynomial  the  number 
k  remains  unchanged,  because  f{z)  has  no  roots  on  the  imaginary  axis. 
Therefore,  starting  from  (44)  we  determine  the  number  of  roots  in  the 
right  half -plane  by  the  formula 

\        ^1         \    \+ll      ^^p+^       "^"-'^ . 

An  elementary  calculation  based  on  (42)  and  (44)  shows  that 


(«o. 


Ao  J„^i    4      ^  ,      /     ,v..   /     /- 


^:,4 ^,  ^\='^+'-^^\.-.„.LA^A]  m 


A A*  *  A     J~^  2  U==sign kZ-i- 


Note  that  the  value  on  the  left-hand  side  of  (46)  does  not  depend  on  the 
method  of  varying  the  coefficients  and  retains  one  and  the  same  sign  for 
arbitrary  small  variations.  This  follows  from  (45),  because  k  does  not 
change  its  value  under  small  variations  of  the  coefficients. 


33  Essentially  the  same  terms  have  already  been  computed  above  for  A^,  A^,  . . .  ,A„. 
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2.    Suppose  now  that  for  5  >  0 

A,+i  =  '"=A,+p  =  0  (47) 

and  that  all  the  remaining  Hurwitz  determinants  are  different  from  zero. 
We  denote  by  So,  Si,  ...  and  bo,  &i,  ...  the  elements  of  the  (s  +  l)-th 
rows  in  Routh's  scheme  (So  =  As./^s-i,  &o  =  As+i/As).    We  denote  the  cor- 
responding determinants  by  Ai,  A2,  . . . ,  Ans.    By  formula  (38)  (p.  195), 

As+i  =  AgAi,  . . . ,  As+p  —  AgAp,    As+p+i  =  AgAp+i,   As+p+2  —  AsAp+2 •       (48) 

Then  by  1.  it  follows  that  p  is  odd,  say  p  =  2h  —  1.^* 

Let  us  vary  the  coefficients  oi  f(z)  in  such  a  way  that  all  the  Hurwitz 
determinants  become  different  from  zero  and  that  those  that  were  different 
from  zero  before  the  variation  retain  their  sign.  Since  the  formula  (46)  is 
applicable  to  the  determinants  A,  we  then  obtain,  starting  from  (48)  : 


\  *''    ''  A,_J  ^      \A,_i*    As  As+p+il  ^      \As+p+i  A„-il 


The  value  on  the  left-hand  side  of  (49)  again  does  not  depend  on  the  method 
of  variation. 

3.  Finally,  let  us  assume  that  among  the  Hurwitz  determinants  there  are 
V  groups  of  zero  determinants.  We  shall  show  that  for  every  such  group 
(47)  the  value  on  the  left-hand  side  of  (49)  does  not  depend  on  the  method 
of  variation  and  is  determined  by  that  formula.^^  We  have  proved  this 
statement'  for  v  =  1.  Let  us  assume  that  it  is  true  for  v  —  1  groups  and 
then  show  that  it  is  also  true  for  v  groups.  Suppose  that  (47)  is  the  second 
of  the  V  groups;  we  determine  Ai,  A2  in  the  same  way  as  was  done  under  2. ; 
then  for  this  variation 


34  In  accordance  with  footnote  32,  for  p=:2h  —  1  and  odd  h, 

M-l 
signZl,.^.p+2=  ( — 1)   2  sign^5_i; 

and  for  even  h, 

h 
sign  As +  p  +  i=  (—1)2  signZl^, 

35  From  (47)  and  zl^  y^  0,As+p+i  v^  0  it  follows  by  (48)  and  (42)  that  As^i  ¥=  0, 
As  +  p+2^0. 
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Since  we  have  only  v  —  1  groups  of  zero  determinants  on  the  right-hand  side 
of  this  equation,  our  statement  holds  for  the  right-hand  side  and  hence  for 
the  left-hand  side  of  the  equation.  In  other  words,  the  formula  (49)  holds 
for  the  second,  . . . ,  v-th  group  of  zero  Hurwitz  determinants.  But  then  it 
follows  from  the  formula 

^  Ai  A„_i/ 

that  the  value  of  V  (3^,  ^7-'  ^i'  •  •  •'  ^S^)  ^^^^  ^^^  depend  on  the 
method  of  variation  for  the  first  group  of  zero  determinants,  and  therefore 
that  (49)  holds  for  this  group  as  well. 

Thus  we  have  proved  the  following  theorem : 

Theorem  5  :  If  some  of  the  Hurwitz  determinants  are  zero,  hut  An  ^  0, 
then  the  number  of  roots  of  the  real  polynomial  f{z)  in  the  right  half -plane 
is  determined  by  the  formula 

"='{""'4 £r) 

in  which  for  the  calculation  of  the  value  of  V  for  every  group  of  p  successive 
zero  determinants  (p  is  always  odd!) 

{As¥^0)  As+i  =  '--  =  As+p=0    (J.+p+i^O) 
we  have  to  set 

\As-i     As  Ag+p+ij  2  ' 


where^^ 


P  =  2h-l      and   e  =  sign(-^i^±^] 

\As-l  As+p+1/ 


r 
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Number  of  Distinct  Real  Roots  of  a  Polynomial 

Routh  obtained  his  algorithm  by  applying  Sturm's  theorem  to  the  computa- 
tion of  the  Cauchy  index  of  a  regular  rational  fraction  of  special  type  (see 
formula  (10)  on  p.  178) .    Of  the  two  polynomials  in  this  fraction— numera- 


For  s=l  -^-~  is  to  be  replaced  by  A^ ;  and  for  s  ^  0,  by  Oo. 
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tor  and  denominator — one  contains  only  even,  the  other  only  odd  powers  of 
the  argument  z. 

In  this  and  in  the  following  sections  we  shall  explain  the  deeper  and 
more  comprehensive  method  of  quadratic  forms,  due  to  Hermite,  in  its 
application  to  the  Routh-Hurwitz  problem.  By  means  of  this  method  we 
shall  obtain  an  expression  for  the  index  of  an  arbitrary  rational  fraction 
in  terms  of  the  coefficients  of  the  numerator  and  denominator.  The  method 
of  quadratic  forms  enables  us  to  apply  the  results  of  Frobenius'  subtle  inves- 
tigations in  the  theory  of  Hankel  forms  (Vol.  I,  Chapter  X,  §  10)  to  the 
Routh-Hurwitz  problem  and  to  establish  a  close  connection  of  certain  re- 
markable theorems  of  Chebyshev  and  Markov  with  the  problem  of  stability. 

1.  We  shall  acquaint  the  reader  with  the  method  of  quadratic  forms  first 
in  the  comparatively  simple  problem  of  determining  the  number  of  distinct 
real  roots  of  a  polynomial. 

In  the  solution  of  this  problem  we  may  restrict  ourselves  to  the  case 
where  i{z)  is  a  real  polynomial.  For  suppose  that  j{z)  =u(z)  +iv(z)  is 
a  complex  polynomial  (uiz)  and  v(z)  being  real  polynomials).  Each  real 
root  of  f{z)  makes  u(z)  and  v(z)  vanish  simultaneously.  Therefore  the 
complex  polynomial  f(z)  has  the  same  real  roots  as  the  real  polynomial  d{z)j 
the  greatest  common  divisor  of  u{z)  and  v{z). 

Thus,  let  /( 2!)  be  a  real  polynomial  with  the  distinct  roots  ai,  02,  . . . ,  Og 
of  the  respective  multiplicities  ni,  n2,  . . . ,  rigi 

f  (z)  =  ao{z- a, r^  {z - a^r^  -"{z-  af' 

(ttoT^O;  a^v^a^i,     for    iy^k;   i,k  =  l,2,  .  . .,  q) . 

We  introduce  Newton's  sums 

Sp=2Jn.oc^     {p  =  0,l,2,  ...). 

With  these  sums  we  form  the  Hankel  forms 

n— 1 

where  n  is  an  arbitrary  integer,  n'^q. 
Then  the  following  theorem  holds : 

Theorem  6 :  The  numher  of  all  the  distinct  roots  of  f(z)  is  equal  to  the 
rank,  and  the  number  of  all  the  distinct  real  roots  to  the  signature,  of  the 
form  Sn{x,x). 
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Proof.    From  the  definition  of  the  form  Sn(x,  x)  we  immediately  obtain 
the  following  representation : 


S„  {x;  x)=2J '^j  (^0  +  ^i^\  +  a^Xg  H h  a"' 


(51) 


Here  to  each  root  a,  of  f{z)  there  corresponds  the  square  of  a  linear  form 
Zj  =  Xo  +  ttjXi  +  .  .  .  +  a^-'^Xn-i  (j  =  l,2,...,q).  The  forms  Zi,  Zg, .  .  . ,  Z^ 
are  linearly  independent,  since  their  coefficients  form  the  Vandermonde 
matrix  11  a/  11  whose  rank  is  equal  to  the  number  of  distinct  aj,  i.e.,  to  q. 
Therefore  (see  Vol.  I,  p.  297)  the  rank  of  the  form  Sn{x,  x)  is  q. 

In  the  representation  (51)  to  each  real  root  ay  there  corresponds  a  posi- 
tive square.  To  each  pair  of  conjugate  complex  roots  aj  and  oy  there  cor- 
respond two  complex  conjugate  forms: 

the  corresponding  terms  in  (51)  together  give  one  positive  and  one  negative 
square : 


^A'  + 


i.Z';=2n.P^-2nfil 


Hence  it  is  easy  to  see^''  that  the  signature  of  Sn(x,  x),  i.e.,  the  difference 
between  the  number  of  positive  and  negative  squares,  is  equal  to  the  number 
of  distinct  real  a,-. 

This  proves  the  theorem. 

2.  Using  the  rule  for  determining  the  signature  of  a  quadratic  form  that 
we  established  in  Chapter  X  (Vol.  I,  p.  303),  we  obtain  from  the  theorem  the 
following  corollary: 

Corollary:  The  number  of  distinct  real  roots  of  the  real  polynomial 
f{z)  is  equal  to  the  excess  of  permanences  of  sign  over  variations  of  sign  in 
the  sequence 

)  5i    .  .  .    5. 


Ij  "^0' 


'n-1 


'n-1 


'2/1-2 


(52) 


where  the  5p  (p  =  0,  1,  . . .)  are  Newton's  sums  for  f(z)  and  n  is  any  integer 
not  less  than  the  number  q  of  distinct  roots  of  f(z)  (in  particular,  n  can  he 
chosen  as  the  degree  of  f{z)). 


37  The  quadratic  form  Sn{,x,x)  is  representable  as  an  (algebraic)  sum  of  q  squares  of 
the  real  forms  Z)  (for  real  (tj)  and  F}  and  Q}  (for  complex  clj).  These  forms  are  linearly 
independent,. since  the  rank  of  Sn(x,x)  is  q. 


204    XV.  The  Problem  of  Routh-Hurwitz  and  Related  Questions 


This  rule  for  determining  the  number  of  distinct  real  roots  is  directly- 
applicable  only  when  all  the  numbers  in  (52)  are  different  from  zero.  How- 
ever, since  we  deal  here  with  the  computation  of  the  signature  of  a  Hankel 
form,  by  the  results  of  Vol.  I,  Chapter  X,  §  10,  the  rule  with  proper  refine- 
ments remains  valid  in  the  general  case  (for  further  details  see  §  11  of  that 
chapter). 

From  our  theorem  it  follows  that :  All  the  forms 

8n{x,x)     {n=q,q+l,  . . .) 

have  the  same  rank  and  the  same  signature. 

In  applying  Theorem  6  (or  its  corollary)  to  determine  the  number  of 
distinct  real  roots,  we  may  take  n  to  be  the  degree  of.  f{z). 

The  number  of  distinct  real  roots  of  the  real  polynomial  f{z)  is  equal  to 


the  index/-''"  4^ 
—  f{z) 

the  formula 


(see  p.  175) .    Therefore  the  corollary  to  Theorem  6  gives 


/i:^=»-2F|i,.„, 


«0        «! 
Si        So 


•n-1 


5«    ...  S^ 


.   S 


2n-2 


where  Sp=  ^  n^a^  {p  =  0,  1,  . . .)  are  Newton's  sums  and  n  is  the  degree 

oif(z).       ''' 

In  §  11  we  shall  establish  a  similar  formula  for  the  index  of  an  arbitrary 
rational  fraction.  The  information  on  infinite  Hankel  matrices  that  will  be 
required  for  this  purpose  will  be  given  in  the  next  section. 
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1.   Let 


),    Sl,    S2y 


be  a  sequence  of  complex  numbers.    This  determines  an  infinite  symmetric 
matrix 


S  = 


Sq    Si    S2 
Sl    S2    53 

^2      *3      *4 
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which  is  usually  called  a  Hankel  matrix.  Together  with  the  infinite  Hankel 
matrices  we  shall  consider^®  the  finite  Hankel  matrices  Sn  =  11  Si+k  11  J~^ 
and  their  associated  Hankel  forms 

n— 1 

S„{X,X)=   2J  Si+^XiX^. 
i,k=0 

The  successive  principal  minors  of  8  will  be  denoted  by  Di,  D2,  Z>3,  •  •  • 

Infinite  matrices  may  be  of  finite  or  of  infinite  rank.  In  the  latter  case, 
the  matrices  have  non-zero  minors  of  arbitrarily  large  order.  The  following 
theorem  gives  a  necessary  and  sufficient  condition  for  a  sequence  of  numbers 
Soj  Si,  $2,  . . .  to  generate  an  infinite  Hankel  matrix  S=  11  Si+fc  11  ^  of  finite 
rank. 

Theorem  7 :  The  infinite  matrix  S=  \\  Si+ kll^  is  of  finite  rank  r  if  and 
only  if  there  exist  r  numbers  ai,  «£,  . . . ,  ar  such  that 


=  2J<XgSq-g       {q  =  r,r+l,...)  (53) 


and  r  is  the  least  number  having  this  property. 

Proof.  If  the  matrix  ^=  11  Si+k  11*  has  finite  rank  r,  then  its  first 
r  +  1  rows  Ri,  B2,  . . . ,  Rr+i  are  linearly  dependent.  Therefore  there  exists 
a  number  h'^r  such  that  Bi,  R2,  . . . ,  Rh  are  linearly  independent  and  Rh+i 
is  a  linear  combination  of  them : 

h 

We  consider  the  rows  Rg+i,  Rq+2,  -  • ,  Rq+h+i,  where  q  is  any  non- 
negative  integer.  From  the  structure  of  S  it  is  immediately  clear  that  the 
raws  Rq+i,  Rq+2,  -  •  - ,  Rq+n+i  are  obtained  from  Ri,  R2,  . . . ,  Rh+i  by  a 
*  shortening '  process  in  which  the  elements  in  the  first  q  columns  are  omitted. 
Therefore 

h 

Rq+h+l  =2J<^gRq+h-g+l  {q=0,l,  2,    ...). 

g=l 

Thus,  every  row  of  S  beginning  with  the  {h  +  l)-th  can  be  expressed  linearly 
in  terms  of  the  h  preceding  rows  and  therefore  in  terms  of  the  linearly 


38  See  Vol.  I,  Chapter  X,  §  10. 
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independent  first  h  rows.     Hence  it  follows  that  the  rank  of  S  is  r  =  h.^^ 
The  linear  dependence 

h 

^q+h+1  ^^   ^g  ^q+h-g+1 
9=1 

after  replacement  of  /i  by  r  and  written  in  more  convenient  notation 
yields  (53). 

Conversely,  if  (53)  holds,  then  every  row  (column)  of  >S^  is  a  linear  com- 
bination of  the  first  r  rows  (columns).  Therefore  all  the  minors  of  S  whose 
orders  exceed  r  are  zero  and  S  is  of  rank  at  most  r.  But  the  rank  cannot  be 
less  than  r,  since  then,  as  we  have  already  shown,  there  would  be  relations 
of  the  form  (53)  with  a  smaller  value  than  r,  and  this  contradicts  the  second 
condition  of  the  theorem.    The  proof  of  the  theorem  is  now  complete. 

Corollary:  If  the  infinite  Hankel  matrix  S=\\  Si+k  11  *  is  of  finite 
rank  r,  then 

For  it  follows  from  the  relations  (53)  that  every  row  (column)  of  S  is 
a  linear  combination  of  the  first  r  rows  (columns).  Therefore  every  minor 
of  ;S^  of  order  r  can  be  represented  in  the  form  aDr,  where  a  is  a  constant. 
Hence  it  follows  that  Dr  ^  0. 

'Note.    For  finite  Hankel  matrices  of  rank  r  the  inequality  Dry^O  need 

Sq     Si 


for  So  =  Si  =  0,  S2  =7^  0  is  of  rank  1, 


not  hold.     For  example  /S^2  = 

whereas  Di  =  So  =  0. 

2.    We  shall  now  explain  certain  remarkable  connections  between  infinite 
Hankel  matrices  and  rational  functions. 
Let 

be  a  proper  rational  fractional  function,  where 

h  {z)  =  a^z-^  +  •  •  •  +  a^      (aov^O),       g{z)  =  b^z"^-^  +  b^z"^-^  +  "'i-h^. 
We  write  the  expansion  oiR(z)  in  a  power  series  of  negative  powers  of  z : 


39  The  statement  *  The  number  of  linearly  independent  rows  in  a  rectangular  matrix  is 
equal  to  its  rank'  is  true  not  only  for  finite  rows  but  also  for  infinite  rows. 
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If  all  the  poles  oiR{z),  i.e.,  all  the  values  of  z  for  which  R{z)  becomes  infinite, 
lie  in  the  circle  |  2:  |  ^  a,  then  the  series  on  the  right-hand  side  of  the 
expansion  converges  for  |  2;  |  >  a..  We  multiply  both  sides  by  the  denomi- 
nator h{z)  : 

(ao2™  +  aje"-' +  •■•  + aj  (^  +  |i  +  || +•••)=  612"-' +  ft^z^-^ +•••  + 6„ . 

Equating  coefficients  of  equal  powers  of  z  on  both  sides  of  this  identity, 
we  obtain  the  following  system  of  relations : 


(54) 


agSg  +  ttiVi  +  •  •  •  +  <^mh-m  =  ^  (g  =  w,  m  +  I,  .  .  .).  (54') 

Setting 

aj,=  — ^       (9^=1,2,  ...,  m), 

we  can  write  the  relations  (54')  in  the  form  (53)  (for  r^=m).    Therefore, 
by  Theorem  7,  the  infinite  Hankel  matrix 

^  =  hi+k\\o 

formed  from  the  coefficients  So,  Si,  $2,  . . .  is  of  finite  rank  (^  m). 

Conversely,  if  the  matrix  ^  =  11  Si+fc  IL*  is  of  finite  rank  r,  then  the  rela- 
tions (53)  hold,  which  can  be  written  in  the  form  (54')  (for  m  =  r).  Then, 
when  we  define  the  numbers  fei,  &2,  •  •  • ,  &m  by  the  equations  (54)  we  have  the 
expansion 

a^z"^  +  ajz"*-!  H \-am        z   '^  z^  ^         * 

The  least  degree  of  the  denominator  m  for  which  this  expansion  holds  is 
the  same  as  the  least  integer  m  for  which  the  relations  (53)  hold.  By  Theo- 
rem 7,  this  least  value  of  m  is  the  rank  of  ^==11  Si+fc  11  *• 

Thus  we  have  proved  the  following  theorem : 

Theorem  8:  The  matrix  S=\\  Si+jc\\°°  is  of  finite  rank  if  and  only  if 
the  sum  of  the  series 

is  a  rational  function  of  z.    In  this  case  the  rank  of  S  is  the  same  as  the 
number  of  poles  of  Biz),  counting  each  pole  with  its  proper  multiplicity. 
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§11.    Determination  of  the  Index  of  an  Arbitrary  Rational  Fraction 
by  the  Coefficients  of  Numerator  and  Denominator 

1.    Suppose  given  a  rational  function.    We  write  its  expansion  in  a  series 
of  descending  powers  of  z:^^ 

B  (z)  =  s_^_^z^  +  •  •  •  +  s_^z  +  5_i  +  y  +  |i  H-  •  •  • .  (55) 

The  sequence  of  coefficients  of  the  negative  powers  of  z 

^0'  ^l»  *2'   •  •  • 

determines  an  infinite  Hankel  matrix  S=^  11  ^i+fc  11  *. 

II      ^     llo 

We  have  thus  established  a  correspondence 

B{z)  -  8. 

Obviously  two  rational  functions  whose  difference  is  an  integral  func- 
tion correspond  to  one  and  the  same  matrix  S.  However,  not  every  matrix 
^=||5i+fc||*  corresponds  to  some  rational  function.  In  the  preceding 
section  we  have  seen  that  an  infinite  matrix  S  corresponds  to  a  rational 
function  if  and  only  if  it  is  of  finite  rank.  This  rank  is  equal  to  the  number 
of  poles  of  E{z)  (multiplicities  taken  into  account),  i.e.,  to  the  degree  of 
the  denominator  f{z)  in  the  reduced  fraction  g(z) /f{z)  ^=R(z).  By  means 
of  the  expansion  (55)  we  have  a  one-to-one  corespondence  between  proper 
rational  functions  Biz)  and  Hankel  matrices  ^=11  Si+k  11  *  of  finite  rank. 

We  mention  some  properties  of  the  correspondence: 

1.  If  Bi{z)  ^  8i,  Bziz)  '^  ^2,  then  for  arbitrary  numbers  Ci,  C2 

CiBi{z)  +  C2B2(z)   ^  CiSi  +  C2S2. 

In  what  follows  we  shall  have  to  deal  with  the  case  where  the  coefficients 
of  the  numerator  and  the  denominator  oiB(z)  are  integral  rational  functions 
of  a  parameter  a;  B  is  then  a  rational  function  of  z  and  a.  From  the  expan- 
sion (54)  it  follows  that  in  this  case  the  numbers  So,  Si,  S2,  .  . . ,  i.e.,  the  ele- 
ments of  S,  depend  rationally  on  a.  Differentiating  (55)  term  by  term 
with  respect  to  a,  we  obtain : 

2.  UB{z,a)^S{a),then^^^^~^^ 


^0  The  series  (55)  converges  outside  every  circle  (with  center  at  0^0)  containing  all 
the  poles  of  R(z). 

"  If  5=  II  S.+.  lir ,  then  ^  =  ll-g-*C  • 
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2.    Let  us  write  down  the  expansion  of  B{z)  in  partial  fractions: 

where  ^(2;)  is  a  polynomial;  we  shall  show  how  to  construct  the  matrix  8 
corresponding  to  B{z)  from  the  numbers  a  and  A. 

For  this  purpose  we  consider  first  the  simple  rational  function 


1     _   ^^ 
It  corresponds  to  the  matrix 

««=ii«'+'ii„~- 

The  form  Sani^,^)  associated  with  this  matrix  is 
n— 1 
>S«„ {x, x)  =2J  a*+* Xi x^={xQ  +  ax^  +  '''  +  a""' x„_i)^ . 


If 


g     Ai) 

7=1  J 


then  by  1.  the  corresponding  matrix  S  is  determined  by  the  formula 


? 


,=1  y=l 

and  the  corresponding  quadratic  form  is 


In  order  to  proceed  to  the  general  case  (56),  we  first  differentiate  the 
relation 


h  —  1  times  term  by  term.    By  1.  and  2.,  we  obtain : 


1  1       a'i-i  So 


(z  — a)A  {h  —  l)l    daJ^-^ 


+  fc' 


(;rxy--ir,(:ro-ofor^+.<.-i 
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Therefore,  by  using  rule  1.  again  we  find  in  the  general  case,  where  R{z) 
has  the  expansion  (56)  : 

B(z)  ~  S=|(^«.+  ^0,^  +  ...  +  ^^«.g^)s,,.         (57) 
By  carrying  out  the  differentiation,  we  obtain : 

s=  \\i:Af  ot;" + Af  i'f)  «r'-^ +•••+<  (^;-*i)  «^*-''-+'  ii„-. 

n-l 

The  corresponding  Hankel  form  Sn(x,x)=    2J  ^i+k^t^k  is 

3.    Now  we  are  in  a  position  to  enunciate  and  prove  the  fundamental 
theorem  :*2  I 

Theorem  9 :    If  j 

cmd  m  is  the  rank  of  S,^^  then  the  Cauchy  index  7+^  R(z)  is  equal  to  the        \ 
signature^*  of  the  form  Sn{x,x)  for  any  n^m:^  ^  ? 

Proof.    Suppose  that  the  expansion  (56)  holds.    Then,  by  (57),  | 

J.  ^ 

;=i       ^  [ 

where  each  term  is  of  the  form  | 

I 

^«  =  (^i  +  ^2^  +  "-  +  (^A|^)^a,     S^=\\a^-^^\\-  (58) 

and 

9 

Sn  {x,  x)^2J  T,i  {x,  x)=   2J   Taj  [x,  x)  ^     J^     [T^j  {x,  x)  +  T^, {X,  x)-\ 
7  =  1  a .  real  a    complex  ' 


^2  This  theorem  was  proved  by  Hermite  in  1856  for  the  simplest  case  where  B{z)  has 
no  multiple  poles  [187].  In  the  general  case  it  was  proved  by  Hurwitz  [204]  (see  also 
[25],  pp.  17-19).     The  proof  in  the  text  differs  from  Hurwitz'  proof. 

*3  As  we  have  already  mentioned,  m  is  the  degree  of  the  denominator  in  the  reduced 
representation  of  the  rational  fraction  E{3)   (see  Theorem  8  on  p.  207). 

**  We  denote  the  signature  of  Sn(x,x)  by  a[Sn(x,x)]. 
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By  Theorem  8,  the  rank  of  the  matrix  Tap  and  hence  of  the  form  Taj{x,  x), 

9 

is  vj  (i  =  1,  2,  .  . . ,  q)  and  the  rank  of  Snix,  x)  is  171  =  ^  Vj.     But  if  the 

rank  of  the  sum  of  certain  real  quadratic  forms  is  equal  to  the  sum  of  the 
ranks  of  the  constituent  forms,  then  the  same  relation  holds  for  the 
signatures : 

a[SJx,x)]=   2J  o[Taj{x,x)]+     Z     a\_Ta.^[x,x)-^T^.',x,x)-\.     (59) 
a^-  real  ay  complex 

We  consider  two  cases  separately : 

1)  a  is  real.    Under  any  variation  of  the  parameters  Ai,  A^,  .  . . ,  A^-x 
and  a  in 


^1 


z  —  a       (z  —  a) 


+ 


Av 


(2 -a) 


(60) 


the  rank  of  the  corresponding  matrix  To.  remains  unchanged  (=  v)  ;  there- 
fore the  signature  of  Ta{x,x)  also  remains  unchanged  (see  Vol.  I,  p.  309). 
Therefore  cf\Ta{x,x)'\  does  not  change  if  we  set  in  ( 59 )  and  ( 60 )  :  Ai  =  . . .  = 
A;;_i  =  0  and  a  =  0,  i.e.,  if  for  To.  we  take  the  matrix 


v-\ 

A. 

0 

0    .  .  . 

0     0  .  .  . 

0 

0 

• 

'• 

1      a^-i/Sa  _ 
(v— 1)!  aa"-! 

0      • 
A.        ' 
0 
0 

• 

The  corresponding  quadratic  form  is  equal  to 

2Av  {x^Xv—\  +  x^Xy—1  ^ V  x,—xx^  for  v  =  25 , 

Av  [2  {x^v-Y  +  •  •  •  +  x,_^x^  +  a;,'_i]  f  or  r  =  25  -  1 


(«  =  1,  2,  3, 
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But  the  signature  of  the  upper  form  is  always  zero  and  that  of  the  lower 
form  is  sign  A„.    Thus,  if  a  is  real,  then 


a[Ta{x,x)]  =  \    .       ,     ,         ,,  (61) 

I  sign  Av, 


for  even  v 
sign  Av,  for  odd  v 


2)  a  is  complex. 


Ta  {X,  x)  =  2J  {Pjt  +  iQic)\     Taj  {X,  x)  =  J!  (P,  -  iQ,)^, 

k=l  k=l 

where  Pjc,  Qk  {k  =  l,  2,  . . .  y  v)  are  real  linear  forms  in  the  variables  Xo,  xi, 
X2,  . . .  y  Xn-i.    Then 

V  V 

TAx,x)  +  Ta{x,x)  =  2  2J  PI-2  2JQI  (62) 

k=l  k=l 

Since  the  rank  of  this  quadratic  form  is  2i/,  the  P^,  §&  (A;  =  1,  2,  .  .  . ,  v)  are 
linearly  independent,  so  that  by  (62)  for  a  complex  a 

G[Ta{x,x)  +  T^{x,x)]=0.  (63) 

From  (59),  (61),  and  (63)  it  follows  that 

olS,{x,x)]=     2J      sign  4^). 
/   a/  real  \ 
^    V    odd  i 

But  on  p.  175  we  saw  that  the  sum  on  the  right-hand  side  of  this  equation 
is  /^^  B{z).    This  completes  the  proof. 
From  this  theorem  we  deduce : 

Corollary  1:    If  B{z)  ^  S=\\  Si+jc\\^  and  m  is  the  rank  of  S,  then 

n— 1 

all  the  quadratic  forms  8n{x,x)  =    ^  Si^j^  x^Xj^  {n=^m,  m  +  1,  . . .)  have 

i,k=Q 

one  and  the  same  signature. 

In  Chapter  X,  §  10  (Vol.  I,  pp.  343-44)  we  established  a  rule  for  comput- 
ing the  signature  of  a  Hankel  form;  moreover,  Frobenius'  investigations 
enabled  us  to  formulate  a  rule  that  embraces  all  singular  cases.     By  the 


^^  Each  of  the  products  XnX-,.\,  .riav-2,  . . .  can  be  replaced  by  a  difference  of  squares 

I         2         )  \         2         ]  '    \         2         )  \         2         ] 

All  the  squares  so  obtained  are  linearly  independent. 
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theorem  above  we  can  apply  this  rule  to  compute  the  Cauchy  index.    Thus 
we  obtain : 

Corollary  2 :  The  index  of  an  arbitrary  rational  function  E{z)  whose 
corresponding  matrix  S=  ||  Si+k  ||  *  is  of  ramk  m,  is  determined  by  the 
formula 

IlZIt{z)  =  m-2V  (1,  2)i,  Dg,  . . .,  DJ,  (64) 

where 

St,...  Sj_^ 


J^/^K-4-*Ko-' 


«0 


52  ...  -5/ 


(/  =  1,2,  ...,m); 


(65) 


5/_l       Sf  ...  52/_2 

if  among  Di,  D2,  . . . ,  Dm  there  is  a  group  of  vanishing  determinants'^ 

{Dn^O)     D,^,=  "-  =  0,^^  =  0     (D,+p+i^O), 

then  in  the  computation  of  V{Dft,  D^^+i,  .  .  . ,  jD^+p+i)  ^^  cc^n  take 

sign  J^h+j  =(—  1)  '~V-  sign  D^     {j  =  1,2,...,  p) 
and  this  gives 


2      for  odd  p, 
p+1  — e 


for  even  p  and  e=  (—  1)  2  sign 


D 


(66) 


A+p+l 


In  order  to  express  the  index  of  a  rational  function  in  terms  of  the 
coefficients  of  the  numerator  and  denominator  we  shall  require  some  addi- 
tional relations. 

First  of  all,  we  can  always  represent  B{z)  in  the  form*^ 

R[z)  =  Q(z)+lfy 
where  Q{z),  g{z),h{z)  are  polynomials  and 

h  (z)  =a^^  +  a^z^-^  +  •  •  •  +  a^  (ao^O),  g  (z)  =h^z^  +  h^z^-^  +  • .-  +  6„. 
Obviously,  _,,x 


4®  Here  we  always  have  Dm  7^  0  (p.  206). 

*'^  It  is  not  necessary  to  replace  B{z)  by  a  proper  fraction.     For  what  follows  it  is 
sufficient  that  the  degree  ot  g(z)  does  not  exceed  that  of  h(z). 
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Let 

If  we  now  get  rid  of  the  denominator  and  then  equate  equal  powers  of  z  on 
the  two  sides  of  the  equation,  we  obtain : 


Vm-l  +  «l«m-2  +  •  •  •  +  am«-l  =^m . 

Vt+  ai5(_i  +  •  •  •  +  arnSt-m  =^       (<  =^»  TTl  +  I,  .  .  ,)  . 


(67) 


Using  (67),  we  find  an  expression  for  the  following  determinant  of  order 
2p  in  which  we  put  %  =  0,  hj  =  0  for  j  ^  m: 


«o        ^1        ^2 


2p-l 


2p-2 


6o  6i  62  •  •  •  ^; 
0  c^o  a^^  . . .  a, 
0       60      61  . . .  6j 


0 


»-i 


0  ...0 

^1  .  .  .  •^2p-2 

0  ...0 


5_i        5n 


>2/?-8 


An      ^1       (In  .  .  .  a 


2p-] 


0      Co     ai  . 


1  •  •  •  a2p_2 

0      0      ttQ. ..  a2p_3 


0      0      0  .,.a, 


p(p-i) 
=(- 1)      ' 


5p_2       *p-l  •  •  •  *' 


/)-2 
2p-3 


5n 


«i 


.  .  S^ 


'0  ^1  •  •  •  ^p-1 

We  introduce  the  abbreviation 


—  "0 


V-i 


.  .5, 


>2p-2 


=  alPD^.  (68) 


P«»,= 


CTq     «i  . . .  a 
60     fei  . . .  6. 


2p-l 
2p-l 


0      ttQ  . . .  a2p_2 

0         ^0  •  •  •  *2p-2 


{p  =  l,2,  . ..;  ay=6,  =0  for />m). 


(69) 


Then  (68)  can  be  written  as  follows: 

V,j,=  alPDj,     (p  =  l,2,  ...).  (68') 

By  this  formula,  Corollary  2  above  leads  to  the  following  theorem : 
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Theorem  10:    If  V^^¥^Q,^^  then 

where  V ^    (p  =  1,  2,  . . . ,  m)  is  determined  hy  (69)  ;  if  there  is  a  group  of 
zero  determinants 

(f^2AV^0)        ^2/.+2=---  =  ^2A+2p  =  0       (^2A+2p4-2V^0), 

then  in  computing  V  {V ^n,  ^2h+2>  •  •  •»  ^2h+2p-i-2)^^  ^^'^^  ^^  ^^^  • 

y(y-i) 
signF2/,+2;  =  (-l)      '      sign^aA     {j=l,  2,  .  . .,  p) 

or,  what  is  the  samfie, 


y  {^ 2h*  '  '  •»  F2A+2P+2) 


— 2~      ^^^  ^^  ^ 


^-±4^=^    /o^  e^^^  p  atid  £  =  (-1)1  sign  Ei^^2P+2  ^ 

Note.  If  Fgm  7^  ^>  ^-^M  if  the  fraction  under  the  index  sign  in  (70)  is 
reducible,  then  (70)  must  be  replaced  by  another  formula 

where  r  is  the  number  of  poles  (including  multiplicities)  of  the  rational 
fraction  under  the  index  sign  (i.e.,  r  is  the  degree  of  the  denominator  in  the 
reduced  fraction). 

For  in  this  case  the  index  we  are  interested  in  is 

r-2V{l,D„D^,...,D,), 

since  r  is  the  rank  of  the  corresponding  matrix  S=  11  Si+k  11".  But  the 
equation  (68')  is  of  a  formal  character  and  also  holds  for  reduced  fractions. 
Therefore 

V  (1,  D„  2)2,  ...,D,)  =  V  (1,  ^2,  P4,  .  . .,  17  2,) , 

and  we  have  reached  (70')- 

Formula  (70')  enables  us  to  express  the  index  of  every  rational  fraction 
in  which  the  degree  of  the  numerator  does  not  exceed  that  of  the  denominator 
in  terms  of  the  coefficients  of  numerator  and  denominator. 


**  The  condition  ^201  7^  ^  means  that  Dm  ^  0,  so  that  the  fraction  under  the  index 
sign  in  (70)  is  reduced. 
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§  12.     Another  Proof  of  the  Routh-Hurwitz  Theorem 

1.  In  §  6  we  proved  the  Routh-Hurwitz  theorem  with  the  help  of  Sturm's 
theorem  and  the  Routh  algorithm.  In  this  section  we  shall  give  an  alterna- 
tive proof  based  on  Theorem  10  of  §  11  and  on  properties  of  the  Cauchy 
indices. 

We  mention  a  few  properties  of  the  Cauchy  indices  that  will  be  required 
in  what  follows. 

1.  faB{x)=  —  HB{x).''^ 

2.  IlEx{x)B(x)  =signi?i(ic)/*22(ic)  if  Ri{x)  v^  0,  oo  within  the  inter- 
val (a,  h). 

3.  If  a<c<h,  then  l\  R  {x)  =rR  {x)  +  fB  {x)  +  rj,,  where  f]c  =  0  if 
B{c)  is  finite  and  rjc=±l  if  B(x)  becomes  infinite  at  c;  here  r]c=^+l 
corresponds  to  a  jump  from  —  oo  to  +  oo  at  c  {for  increasing  x) ,  and  rjc= — 1 
to  a  jump  from  +  oo  to  —  oo. 

4.  If  B(—x)=  —  B(x),thenI^_^Bix)=IiBix).  If  B(— x)  =  B(x), 
thenI\B{x)  =  —  IlB{x). 

5.  rB{x)  +  I^{l/B(x) )  = — 2 — f  where  Sa  is  the  sign  of  B{x)  within 
{a,  6)  near  a  and  Sb  is  the  sign  of  B(x)  within  (a,  h)  near  h. 

The  first  four  properties  follow  immediately  from  the  definition  of  the 
Cauchy  index  (see  §  2).    Property  5.  follows  from  the  fact  that  the  sum  of 

the  indices  I^^B(x)  and  /*  -ppr  is  equal  to  the  difference  rii  —  n2,  where  Wi 

is  the  number  of  times  B{x)  changes  from  negative  to  positive  when  x 
changes  from  a  to  h,  and  ^2  the  number  of  times  B(x)  changes  from  positive 
to  negative. 

We  consider  a  real  polynomiaP° 

/  (2)  =  aoz"  +  a^zn-i  +  a^z""-^  +  •  •  •  +  a„_iZ  +  a„     (a^  >  0) , 

We  can  represent  it  in  the  form 

f(z)=h{z^)  +  zg{z^), 

where 

^  (u)  =  a„  +  a^-2^  -f  . . .,     g{u)  =  a„_^  +  a^_^u  +  •  •  •. 


^^  Here  and  in  what  follows  the  lower  limit  of  the  index  may  be  —  oo  and  the  upper 
limit  may  be  -f  oo . 

50  We  have  here  reverted  to  the  usual  notation  for  the  coefficients  of  a  polynomial. 
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We  shall  use  the  notation 

In  §  3  we  proved  (see  (20)  on  p.  180)  that 

Q  =  n-2k-s,  (72) 

where  k  is  the  number  of  roots  of  f{z)  with  positive  real  parts  and  s  the 
number  of  roots  of  f{z)  on  the  imaginary  axis. 

We  shall  transform  the  expression  (71)  for  q. 

To  begin  with,  we  deal  with  the  case  where  n  is  even.    Let  n  —  2m.    Then 

h  {u)=  ttQu""  +  ae^u""-^  + h  a„,     g(u)  =za^u'^-'^  +  a^W^-^  + h  a„_i . 

9  (^) 
Using  the  properties  1.-4.  and  setting  ?y  =  ±  1  if  lim_y7^.  =  ±  oo ,  respec- 
tively, and  r]  =  0  otherwise,  we  have : 

—  ^^—    A(_  z2)    -  ^  -  ^^—  A  (tt)         ^  -  ^—  A  (w)         ^-«>    h  (U)  "^ 

^— A(w)        -^  h{u)  ' 
Similarly  we  have  for  odd  ri,  7^  =  2m  +  1 : 

h  {u)  =  a^uT  +  OgW"*-!  +  •••  +  «„,  gr  (w)  =  ao^"*  +  ag^"*"!  +  •  •  •  4-  a„_i . 
Setting"  C  =  sign  [|4^]         if  lim  ~~  =  0  and  C  =  0  otherwise,  we  find : 

«f  — «»  zg  ( —  2*)  — *•  0  ^  — ~  Z^  ( —  Z*)         ^  **  tt^  (m)         ^ 

Thus»2 


51  Here  we  mean  by  sign  [5'(w)/7i(w)]„^0-th6  sign  of  g{u)/h{u)  for  negative  values 
of  u  of  sufficiently  small  modulus. 

52  If  oiT*^  0,  then  the  two  formulas  (73')  and  (73")  may  be  combined  into  the  single 
formula 

^-  --A(i*)+   -'-t^(u)-  ^^^    ^ 
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.=/-igJ-/l""ig     (»  =  2«).  (730 

e=^i=4S-^i=rS      (»  =  2«  +  l).  (73") 

As  before,  we  denote  by  Ai,  A2, . . . ,  An  the  Hurwitz  determinants  oif(z). 
We  assume  that  An  7^  0.^^ 

1)  n  =  2m.    By  (70)," 

^--fS^*^-^^^^'  ^1'  ^«'  ••"  "^"-^^^  ^'^^^ 

=  -m  +  2V{l,A^,A^,...,  AJ.  (75) 

But  then,  by  (73'), 

^  =^-2F(l,  A^,  ZI3,  . . .,  4_i)  -2F(1,  ZI2,  -^4.  .  •  •>  ^„) , 
which  in  conjunction  with  Q  =  n  —  2Jt  gives 

k=V{l,  A^,  4,  . . .,  Zl„_i)  +  F(l,  A^,  A^,  . . .,  A,).  (76) 

2)  ti  =  2m  +  l.    By  (70), «       - 

^--^=^  +  l-2^(l'^i'^3,...,^n),  (77) 

=  -m-h2V{l,A^,A^,...,  4_i) .  (78) 

The  equation  Q  =  2m  +  l  —  2k  together  with  (73"),  (77),  and  (78)  again 
gives  (76). 

This  proves  the  Routh-Hurwitz  theorem  (see  p.  194). 


53  In  this  case  s  =  0,  so  that  Q=:n  —  2Tc.  Moreover,  /l„  =7^  0  means  that  the  fractions 
under  the  index  signs  in  (73')  and  (73")  are  reduced. 

54  In  computing  V^,  V^,  . . . ,  V^m  the  values  Oo,  Oi,  . . . ,  dm  and  bo,  &i,  . . . ,  fcm  must  be 
replaced  by  Oo,  02,  . . . ,  02m  and  0,  Oi,  an,  ... ,  a2m-i  respectively  in  computing  the  first  index 
and  by  Oo,  02,  . . . ,  chm  and  Oi,  as,  . . . ,  a2m-i,  0  respectively  in  computing  the  second  index. 

55  In  computing  the  first  index  in  (70)  we  take  Oo,  Oa, . . . ,  flam,  0  and  0,  ai,  Oj, . . . ,  aam+i, 
respectively,  instead  of  Oo,  ai,  . . . ,  am+i  and  bo,  fti,  . . . ,  6m+i;  and  in  computing  the  second 
index  we  take  Oi,  as,  ... ,  a2m4-i  and  ao,  as,  . . . ,  a2m,  respectively,  instead  of  a©,  oi,  . . . ,  am 
and  ?>o,  fti,  . . . ,  &m. 
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2.    Note  1.    If  in  the  formula 

h=V{\,  Ai,  ZI3,  . . .)  +  F(l,  A^,  A^,  . . .) 

some  intermediate  Hurwitz  determinants  are  zero,  then  the  formula  remains 
valid,  only  in  each  group  of  successive  zero  determinants 

the  following  signs  must  be  attributed  to  these  determinants  (in  accordance 
with  Theorem  7) 

sign  Ai+^j  =(-  1)     *     sign  A^     {j  =  l,  2,  .  . .,  p), 
which  yields : 


F(Zl„^,+2.---^i+2p+2) 


^t"    ,       for  odd  p  , 

(79) 

p  +  1  —  e     ^ _^    ^    or,/J    o  _  /        1  X  2„4„„  ^Z+2p+2 


for  even  p  and  e  =(—  1)  ^sign 


A  careful  comparison  of  this  rule  for  computing  k  in  the  presence  of 
vanishing  Hurwitz  determinants  with  the  rule  given  in  Theorem  5  (p.  201) 
shows  that  the  two  rules  coincide.^*' 

Note  2.  If  An  =  0,  then  the  polynomials  ugiu)  and  h{u)  are  not  co- 
prime.  We  denote  hy  d(u)  the  greatest  common  divisor  of  g{u)  and  h{u) 
and  hy  u'l^diu)  that  oiug{u)  and  li{u)  (7  =  0  or  1).  We  denote  the  degree 
of  d{u)  by  d  and  we  set  h{u)  ^d{u)hi(u)  and  g{u)  ^d{u)gi{u). 

The  irreducible  rational  fraction  giiu)/hi{u)  always  corresponds  to  an 
infinite  Hankel  matrix  ^=  If  Si+fc  11 T  of  rank  r,  where  r  is  the  degree  of 
hi{u).  The  corresponding  determinant  Drj^O  and  Dr+i  =  Dr+2  =  -  -  -  =  0. 
By  (68')  V^,  ^  0,  P2r+2  =  ^2r+4  =  •  •  •  =  0.    Morcovcr, 

When  we  apply  all  this  to  the  fractions  under  the  index  sign  in  (74),  (75), 
(77),  and  (78)  we  easily  find  that  for  every  n  (even  or  odd)  and  x  =  2^  +  y 


and  that  the  formulas  (74),  (75),  (77),  and  (78)  all  remain  valid  in  this 
case,  provided  we  omit  all  the  Ai  with  ^  >  n  —  «  on  the  right-hand  sides  and 
replace  the  number  m  (in  (77),  m  +  1)  by  the  degree  of  the  corresponding 


s^J  We  have  to  take  account  here  of  the  remark  made  in  footnote  36  (p.  201). 
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denominator  of  the  fraction  under  the  index,  after  reduction.     We  then 
obtain  by  taking  (73')  and  (73")  into  account: 

Together  with  the  formula  Q  =  n  —  2k  —  s  this  gives : 

k^=V{l,  A^,  A^,...)+V  (1,  A^,  A^, . . .) ,  (80) 

where  ki  =  k  +  s/2  —  x/2  is  the  number  of  all  the  roots  oif{z)  in  the  right 
half -plane,  excluding  those  that  are  also  roots  of  /( — 2;).^^ 


§  13.    Some  Supplements  to  the  Routh-Hurwitz  Theorem. 
Stability  Criterion  of  Lienard  and  Chipart 

1.    Suppose  given  a  polynomial  with  real  coefficients 


/  (z)  =  a^z""  +  a-^z''-'^ 


+  «„     (ao>0). 


Then  the  Routh-Hurwitz  conditions  that  are  necessary  and  sufficient  for 
all  the  roots  oi  f(z)  to  have  negative  real  parts  can  be  written  in  the  form 
of  the  inequalities 


where 


A^>0,A^>0,...,A^>0, 


(81) 


A 


an    ao    a* 


0     «!    ag  . .  . 
0      tto    ^2  ^4 


(%  =  0  for  k>n) 


is  the  Hurwitz  determinant  of  order  i  (t  =  1,  2,  . . . ,  ti). 

If  (81)  is  satisfied,  then  f{z)  can  be  represented  in  the  form  of  a  product 
of  tto  with  factors  of  the  form  z  +  u,  z^  ■\-  vz  +  w  (li  >  0,  i;  >  0,  zi;  >  0),  so 
that  all  the  coefficients  of  f{z)  are  positive:^® 


^^  This  follows  from  the  fact  that  x  is  the  degree  of  the  greatest  common  divisor  of 
h{u)  and  ug(u)  ;  x  is  the  number  of  'special'  roots  of  f(z),  i.e.,  those  roots  z*  for  which 
—  z*  is  also  a  root  of  f(z).  The  number  of  these  special  roots  is  equal  to  the  number  of 
determinants  in  the  last  uninterrupted  sequence  of  vanishing  Hurwitz  determinants 
(including  J„ ) :    An-x+i  =  •  •  •  =  zl„  =  0 . 

*^  Oo  >  0,  by  assumption. 
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ai  >  0,  ^2  >  0,  .  .  .  ,  a„  >  0  .  (82) 

Unlike  (81),  the  conditions  (82)  are  necessary  but  by  no  means  suffi- 
cient for  all  the  roots  of  f{z)  to  lie  in  the  left  half -plane  Re  2:  <  0. 

However,  when  the  conditions  (82)  hold,  then  the  inequalities  (81)  are 
not  independent.  For  example :  For  ti  ==  4  the  Routh-Hurwitz  conditions 
reduce  to  the  single  inequality  ZI3  >  0 ;  for  ti  =  5,  to  the  two  :  ZI2  >  0,  ZI4  >  0 ; 
for  71  =  6  to  the  two  :  ZI3  >  0,  ZI5  >  0.=« 

This  circumstance  was  investigated  by  the  French  mathematicians 
Lienard  and  Chipart®"  in  1914  and  enabled  them  to  set  up  a  stability  criterion 
different  from  the  Routh-Hurwitz  criterion. 

Theorem  11  (Stability  Criterion  of  Lienard  and  Chipart)  :  Necessary 
cmd  sufficient  conditions  for  all  the  roots  of  the  real  polynomial  f(z)  = 

a^  +  ayZ^~'^  + \-  cLn   (<*o  >  0)  to  have  negative  real  parts  can  he  given 

in  any  one  of  the  following  four  forms  :®^ 


1)  a„>0,  a„_2>0,  ...;  A^>0,A^>0,  ..., 

2)  a„>0,  a„_2>0,  ...;  A^>0,A^>0,  ..., 

3)  a„>0;a„_i>0,  a„_3>0,  ...;  A>0,  A^>0,  . 

4)  «„>0;a„_i>0,  a„_3>0,  ...;  Zl2>0,  A^>0,  . 


From  Theorem  11  it  follows  that  Hurwitz's  determinant  inequalities  (81) 
are  not  independent  for  a  real  polynomial  fiz)  =  af^^-\-  a^z^~'^  +  •  •  •  +  a„ 
(ao  >  0)  in  which  all  the  coefficients  (or  even  only  part  of  them:  a„,  a„_2, 
...  or  a«,a„_i,an-3>  •  •  •)  ^^e  positive.  In  fact :  If  the  Hurwitz  determinants 
of  odd  order  are  positive,  then  those  of  even  order  are  also  positive,  and 
vice  versa. 

Lienard  and  Chipart  obtained  the  condition  1)  in  the  paper  [259]  by 
means  of  special  quadratic  forms.  We  shall  give  a  simpler  derivation  of  the 
condition  1)  (and  also  of  2),  3),  4))  based  on  Theorem  10  of  §  11  and  the 
theory  of  Cauchy  indices  and  we  shall  obtain  these  conditions  as  a  special 
case  of  a  much  more  general  theorem  which  we  are  now  about  to  expound. 

"We  again  consider  the  polynomials  h{u)  and  g{u)  that  are  connected 
with  f{z)  by  the  identity 


5^  This  fact  has  been  established  for  the  first  few  values  of  n  in  a  number  of  papers 
on  the  theory  of  governors,  independently  of  the  general  criterion  of  Lienard  and  Chipart, 
with  which  the  authors  of  these  papers  were  obviously  not  acquainted. 

6^  See  [259].  An  account  of  some  of  the  basic  results  of  Lienard  and  Chipart  can  be 
found  in  the  fundamental  survey  by  M.  G.  Krein  and  M.  A.  Naimark  [25]. 

®^  Conditions  1),  2),  3),  and  4)  have  a  decided  advantage  over  Hurwiti'  conditions, 
because  they  involve  only  about  half  the  number  of  determinantal  inequalities. 
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f{z)  =  h{z^)  +  zg{z^). 
If  n  is  even,  n  =  2m,  then 

h  {u)  =  a^u""  +  a^u"^-^  +•••  +  «„,     9  {u)  =  a^u""-^  +  a^u""-^  +  •  •  •  +  a„_i; 

if  n  is  odd,  n  =  2m  +  1,  then 

h  {u)=a^u"'  +  agw"^-!  +  -"  +  a^,  g{u)  =a^u'^  +  ag^"""^  +  •  •  •  +  a„_i  • 

The  conditions  a„  >  0,  an-z  >  0,  ...  (or  a„_i  >  0,  a„_3  >  0,  . . .)  can 
therefore  be  replaced  by  the  more  general  condition:  Ifi^u)  (or  g{u))  does 
not  change  sign  for  u  >  0.®^ 

Under  these  conditions  we  can  deduce  a  formula  for  the  number  of  roots 
oif{z)  in  the  right  half -plane,  using  only  Hurwitz  determinants  of  odd  order 
or  of  even  order. 

Theorem  12 :    If  for  the  real  polynomial 

f  (z)  =  «oZ"  +  a^z^-^  +  . . .  +  a^  n=  ^  (z^)  +  zg{z^)    [a^  >  0) 

h(u)  (or  g{u))  does  not  change  sign  for  ii  >  0  and  the  last  Hurwitz  deter- 
minant An  v^  0,  then  the  number  k  of  roots  of  f{z)  in  the  right  half -plane 
is  determined  hy  the  formulas 


n  =  2m 


n  =  2w  +  1 


does  not 
change 
sign 
for 

w>0 


fc=2F(l,Ji,Zl3..--»^„) 


1— ec 


=  2F(l,zl2,4-..»^n-i)  + 


1  — ec 


(83) 


9(u) 
does  not 
change 
sign 
for 

where^^ 


k  =  2V{hA„A^,...,An-i)  + 

^2V{hA„A„...,A,)-'- 


k  =  2V{l,Ai,A^,...,An) 


1— eo 


27(1,4.^4....,  ^n-l)  + 


2 
l-so 


£oo  =  sign 


(u) 


[h{ 


w)l  _   .       \9{u)] 


(84) 


62  I.e.,  hiu)  ^  0  or  h{u)  ^  0  f or  w  >  0  (5r(w)  ^  0  or  g(u)  ^  0  f or  w  >  0). 

63  If  OiT^O,  then  Coo  =  sign  ai ;  and,  more  generally,  if  ai  =  (h  =  . .  .=^a2fi-i=0, 
a2fi+  \y^  0,  then  Coo  =  sign  a2;i+i.  If  On-i  v^  0,  then £„  =  sign  (U-J On',  and,  more  generally, 
if  a^_j  =  a^_3=..  .  =  an-2/i-i  =0  andan-2/i-i  v^  0,  then  Consign  a«-2/i-i/a„. 
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Proof.    Again  we  use  the  notation 

Corresponding  to  the  table  (83)  we  consider  four  cases: 

1)  n  —  2m;h{u)  does  not  change  sign  for  u  >  0.    Then^^ 

0     h(u)        0      h{u)      ^' 
and  so  the  obvious  equation 

TO       iM^  _ _  7-0       ^^(^) 
°°h{u)~~  °°  h{u) 

implies  that  :®^ 

— ^  h  (u)  — <-   h  (u)   ' 

But  then  we  have  from  (74)  and  (75)  : 

and  therefore  the  Routh-Hurwitz  formula  (76)  gives: 

k=2V{l,  A^,A^, . . .,  Zl„_i)-  2F(1,  A^,  A^,  ...,AJ. 

2)  n  =  2m;  g(u)  does  not  change  sign  for  u  >  0.     In  this  case, 

h{u)       ^,«,  h{u) 


0        n  lii\  ~"     0 


g  (u)         "      -wgr  {u) 


=  0, 


0        M^         ^0  ^(^)   _  /^ 

SO  that  with  the  notation  ( 84 )  we  have  : 

When  we  replace  the  functions  under  the  index  sign  by  their  reciprocals, 
then  we  obtain  by  5.  (see  p.  216)  : 


64  If   hiui)=0    (wi>0),   then   g(ux)^0,   because    A^y^O.      Therefore   h(u)^0 
(u  >  0)  implies  that  g{u)/h{u)  does  not  change  sign  in  passing  through  u  =  Ui. 
«s  From  An  =«»  ^n-i^  0  it  follows  that  ifi(O)  =  On  v^  0. 
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But  this  by  (74)  and  (75)  gives: 

Hence,  in  conjunction  with  the  Routh-Hurwitz  formula  (76),  we  obtain 

6oo  — —  Eq 


k  =  2V{l,Ai,  4, . . .)  +  ^-^  =  2F(l,zJ2,^„  . . .) 


2 

S)  n  =  2m  +  1,  g{u)  does  not  change  sign  for  u  >  0. 
In  this  case,  as  in  the  preceding  one,  (85)  holds.    When  we  substitute 
the  expressions  for  the  indices  from  (77)  and  (78)  into  (85),  we  obtain: 

V{1,A^,A„  . .  .)-V(hA^,A^, . . .)  =  ^  . 

In  conjunction  with  the  Routh-Hurwitz  formula  this  gives : 

k=2V{l,A^,A^,,..)-^-^=2V{l,A^.A^,,..)  +  ^. 

4)   n  =  2m  +  1,  h(u)  does  not  change  sign  for  u  >  0. 
From  the  equations 

°  h{u)         ^     h{u)  h{u)   ^  h  (u) 

j+o.g{u)       r+ooUg{u)_ 
—  ^(i*)"^  —  h{u)  """• 


we  deduce : 


Taking  the  reciprocals  of  the  functions  under  the  index  sign,  we  obtain : 

T+ooU{u)       ^+oo   h{u)    _ 

Again,  when  we  substitute  the  expressions  for  the  indices  from  (77)  and 
(78),  we  have: 

V{l,A^,A^,...)-V{l,A^,A^,...)  =  ^-^, 
From  this  and  the  Routh-Hurwitz  formula  it  follows  that : 

k  =  2V(l,A^,A^,...)-'^-^  =  2V{l,A^,A^,...)+^-^. 

This  completes  the  proof  of  Theorem  12. 

From  Theorem  12  we  obtain  Theorem  11  as  a  special  case. 
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2.   Corollary  to  Theorem  12 :    //  the  real  polynomial 

f  (2)  =  ao2:"  +  «i2;"-i +  •••  +  «„  K>  0) 
has  positive  coefficients 

ao>0,     ai>0,     a^>0,...    ,a„>0, 

and  An  ¥=  0?  i^^'^  i^^  number  k  of  its  roots  in  the  right  half-plane  He  z  >  0 
is  determined  by  the  formula 

k  =  2V{l,A^,A^, . .  .)  =  2V{1,  A^,A^, . . .) . 

Note.  If  in  the  last  formula,  or  in  (83) ,  some  of  the  intermediate  Hurwitz 
determinants  are  zero,  then  in  the  computation  of  V{1,  Ai,  A^,  .  ,  .)  and 
V(l,  Az,  A4,  .  . .)  the  rule  given  in  Note  1  on  p.  219  must  be  followed. 

But  if  An  =  An-i  =  . .  .  =  An-x  +  1  =  0,  An-x  y^  0,  then  we  disregard  the 
determinants  An-^^  +  i,  .  . . ,  An  in  (83)^^  and  determine  from  these  formulas 
the  number  ki  of  the  'non-singular'  roots  oi  f(z)  in  the  right  half -plane, 
provided  only  that  h{u)  y^O  for  u^  0  or  g(u)  =7^  0  for  u  >  0.^^ 

§  14.    Some  Properties  of  Hurwitz  Polynomials.   Stieltjes'  Theorem. 
Representation  of  Hurwitz  Polynomials  by  Continued  Fractions 

1.    Let 

/  (2)  =  V"  +  «i2"-'  +  •••+««  («o  v^O) 

be  a  real  polynomial.    We  represent  it  in  the  form 

f{z)  =  h{z^)  +  zg{z^). 

We  shall  investigate  what  conditions  have  to  be  imposed  on  h{u)  and 
g(u)  in  order  that  f{z)  be  a  Hurwitz  polynomial. 

Setting  k  =  s  =  0  in  (20)  (p.  180),  we  obtain  a  necessary  and  sufficient 
condition  for  f(z)  to  be  a  Hurwitz  polynomial,  in  the  form 

Q  =  n, 
where,  as  in  the  preceding  sections, 

T+00  a^zn-i  —a^zn-S  -] 


QQZn  —  a2Zn-2  -f 


66  See  p.  220. 

67  In  this  case  the  polynomials  hi(u)   and  gi(u)   obtained  from  h{u)   and  g(u)   by 
dividing  them  by  their  greatest  common  divisor  d(u)  satisfy  the  conditions  of  Theorem  12. 
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Let  n  =  2m.    By  (73')  (p.  218),  this  condition  can  be  written  as  follows : 

r.  =  2m^/l-^-/+-^.  (86) 

°°  h  {u)         — "«   h{u)  ^     ' 

Since  the  absolute  value  of  the  index  of  a  rational  fraction  cannot  exceed 
the  degree  of  the  denominator  (in  this  case,  m),  the  equation  (86)  can 
hold  if  and  only  if 

/+~|M=^and/+n?f/^=-m  (87) 

°°h{u)  —°°  h{u)  ^     ' 

hold  simultaneously. 

For  n  =  2m  +  l  the  equation  (73")  gives  (on  account  ot  Q  =  n)  : 

„-j+oo    ^(^)  r+ooh{u) 

—  ug  (u)       ^—  g  (u)  ' 

When  we  replace  the  fractions  under  the  index  signs  by  their  reciprocals 
(see  5.  on  p.  216)  and  observe  that  h{u)  and  g{u)  are  of  the  same  degree  m, 
we  obtain  :^^ 

77-2m-4-l-/+-'^-7+^'^^^4-£  m\ 

n-Zm+\.-l_^^  (w)      ^—  h  {u)  +  ^"^  •  ^^^f 

Starting  again  from  the  fact  that  the  absolute  value  of  the  index  of  a  fraction 
cannot  exceed  the  degree  of  the  denominator  we  conclude  that  (88)  holds 
if  and  only  if 

/+n|j^  =  m,     /l:?^=-mande.=l  (89) 

— °°  h  {u)  '  ^  h{u)  ^      ' 

hold  simultaneously. 

If  n  =  2m,  the  first  of  equations  (87)  indicates  that  h{u)  has  m  distinct 
real  roots  u^  <iU2  <  . . .  <Um  and  that  the  proper  fractions  g{u)/h(u) 
can  be  represented  in  the  form 


where 


^^=fSo>"       <^=^'' ™)-  ^^^ 


From  this  representation  of  g{u)/h{u)  it  follows  that  between  any  two 
roots  Ui,  Ui+i  of  hiu)  there  is  a  real  root  u/  of  g{u)  (i=l,  2,  .  .  . ,  m  —  1) 
and  that  the  highest  coefficients  of  h{u)  and  g{u)  are  of  like  sign,  i.e., 


As  in  the  preceding  section,  Coo  =  sign  \rj^\ 


§  14.  HuRwiTz  Polynomials.    Stieltjes'  Theorem  227 

h  {u)  =  aQ{u  —  Ui)-"{u  —  uj,  g  {u)  =aj^{u  —  u[)  •••(%  —  u^_^) , 

t^l  <  Wi  <  ^2  <  ^2  <  *  •  •  <  ^m-l  <  '^m-1  <  '^m^  «0«1  >  ^  • 

The  second  of  equations  (87)  adds  only  one  condition 

By  this  condition  all  the  roots  of  h{u)   and  g{u)  must  be  negative. 

If  7i  =  2m  +  l,  then  it  follows  from  the  first  of  equations  (89)   that 
h{u)  has  m  distinct  real  roots  Ui  <  i<^2  <  •  •  •  <  ^m  and  that 


where 


^  =  ^-x  +  ^2'„-i^.  (^-.^0),  (91) 


^i=V^)>^  (i  =  l,2,...,m).  (91') 

The  third  of  equations  (89)  implies  that 

5_i>0,  (92) 

i.e.,  that  the  highest  coefficients  Qo  and  ai  are  of  like  sign.  Moreover,  it 
follows  from  (91),  (91'),  and  (92)  that  c/(ii)  has  m  real  roots  u\  <  w'2  <  •  •  • 
<  u'm  in  the  intervals  ( —  00,  t^i),  (tti,  1^2),  •  •  • ,  {Um-i,Um).    In  other  words, 

h  {u)  =  ai{u  —  u^)-"{u  —  uJ,  g  (u)  =  aQ{u  —  u[)  "^{u  —  uJ  , 

u\<u^<U2,<U2<---<u^<u^\  a^a^Xi. 

The  second  of  equations  (89),  as  in  the  case  71  =  2m,  only  adds  one  further 
inequality 

Definition  3.  We  shall  say  that  two  polynomials  h{u)  and  g{u)  of 
degree  m  {or  the  first  of  degree  m  and  the  second  of  degree  m  —  1)  form  a 
positive  pair^^  if  the  roots  Ui,  U2,  . . . ,  Um  and  u[,  u^,  •  .  .,  ^m  (^^  ^i»  ^2'  •  •  •» 
uln-i)  are  all  distinct,  real,  and  negative  and  they  alternate  as  follows : 

u[<Ui<U2<U2<"'<u'^<U^<0 

(or  Ui<u[<u^<'--<  u^_^  <u^<Qi) 
and  their  highest  coefficients  are  of  like  signJ^ 


69  See  [17],  p.  333.  The  definition  of  a  positive  pair  of  polynomials  given  here  differs 
slightly  from  that  given  in  the  book  [17]. 

^°  If  we  omit  the  condition  that  the  roots  be  negative,  we  obtain  a  real  pair  of  poly- 
nomials.   For  the  application  of  this  concept  to  the  Routh-Hurwitz  problem,  see  [36]. 
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When  we  introduce  the  positive  numbers  Vi=^  —  Ui  and  v/  =  —  u/  and 
multiply  h(u)  and  g{u)  by  +  1  or  —  1  so  that  their  highest  coefficients  are 
positive,  then  we  can  write  the  polynomials  of  this  positive  pair  in  the  form 

m  m 

h(u)  =  a^n{u  +  Vi)y      9M  =  (^oII(u  +  Vi),  (93) 

1=1  i=l 

where 

in  case  both  h{u)  and  g{u)  are  of  degree  m,  and  in  the  form 

m  m—1 

h  {u)  =a^n{f^  +  Vi),  g  {u)  =a^niu  +  <) ,  (93') 

1=1  »=i 

where 

ao>0,  ai>0,     0<v^<v^_T^<v^_j_<'"<v[<Vi, 

in  case  h{u)  is  of  degree  m  and  g{u)  of  degree  m  —  1. 

By  our  earlier  arguments  we  have  proved  the  following  two  theorems: 

Theorem  13  :  The  polynomial  f{z)  =^hiz^)  +  zg{z^)  is  a  Hurwitz  poly- 
nomial if  and  only  if  h{u)  and  g{u)  form  a  positive  pairJ'^ 

Theorem  14:  Two  polynomials  h{u)  and  g{u)  the  first  of  which  is  of 
degree  m  and  the  second  of  degree  m  or  m  —  1  form  a  positive  pair  if  and 
only  if  the  equations 

hold  and,  when  h{u)  and  g{u)  are  of  equal  degree,  the  additional  condition 


Soo  —sign 
holds. 


h(u) 


-1  (95) 


2.  Using  properties  of  the  Cauchy  indices  we  can  easily  deduce  from  the 
last  theorem  a  theorem  of  Stieltjes  on  the  representation  of  a  fraction 
g{u)/h{u)  as  a  continued  fraction  of  a  special  type,  provided  h{u)  and 
g{u)  form  a  positive  pair  of  polynomials. 

The  proof  of  Stieltjes'  theorem  will  be  based  on  the  following  lemma: 


^^  This  theorem  is  a  special  case  of  the  so-called  Hermite-Biehler  theorem   (see   [7], 
p.  21). 
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Lemma.    //  the  polynomials  h{u)  and  g{u)   {h(u)  of  degree  m)  form 
a  positive  pair  and 


h{u) 


:=C  + 


du  + 


gxi'U') 


(96) 


where  c,  d  are  constants  and  hi{u),  gi{u)  are  polynomials  of  degree  not 
exceeding  m  —  1,  then 

1.  c^O,  c?>0; 

2.  hi (u),  gi (u)  are  of  degree  m  —  1 ; 

3.  hi{u)  and  gi{u)  form  a  positive  pair. 

Given  h{u)  and  g{u),  the  polynomials  hi{u)  and  gi{u)  are  uniquely 
determined  (to  within  a  common  constant  factor)  and  so  are  c  and  d. 

Conversely,  from  (96)  and  1.,  2.,  3.  it  follows  that  h{u)  and  g{u)  form 
a  positive  pair,  that  h{u)  is  of  degree  m,  and  g{u)  is  of  degree  m  or  m  —  1 
according  as  cy>  0  or  c  =  0. 

Proof.  Let  h{u),  g{u)  be  a  positive  pair.  Then  it  follows  from  (94) 
and  (96)  that: 


m=^r 


h{u)~   —' 


du  + 


9'i(^) 


(97) 


This  equation  implies  that  gi{u)  is  of  degree  m  —  1  and  that  dy^O. 
Further,  from  (97)  we  find: 


'=-^--h+^l+^^^^^ 


■-ii\ 


hAu) 


9i{u) 


+  sign  d 


Hence  it  follows  that  c?  >  0  and  that 


9i{u) 


(m-1). 


(98) 


The  second  of  equations  (94)  now  gives: 


m=I+ 


(u) 


—'^  h{u) 


/+: 


CW  + 


1 


T+oo 


d  + 


ugi{u) 


=  -/+ 


d  +  ^'^l 

d+'^^^. 
ug^  {u 


—  _  T+' 


==— 7 


hi{u) 


—  ug^  {U)  • 


(99) 


Hence  it  follows  that  hi{u)  is  of  degree  m  —  1. 

Condition  (95)  yields,  by  (96)  :  c  >  0.    But  if  g{u)  is  of  smaller  degree 
than  h{u),  then  it  follows  from  (96)  that  c  =  0. 
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(98)  and  (99)  imply: 

where 


£<i>  =  sign 


91  {u) 


(^)J 


«=  +  oo 


Since  the  second  of  the  indices  (100)  is  in  absolute  value  less  than  m  —  1, 
we  have 

e^l^  =  l,  (101) 

and  then  we  conclude  from   (100)   and   (101),  by  Theorem  12,  that  the 
polynomials  hx{u)  and  gi{u)  form  a  positive  pair. 
From  (96)  it  follows  that 


c=lim|j^,      Hm 


tt->  oo 


^  '      c\u 


h  (u)         J  d  ' 


After  c  and  d  have  been  found,  the  ratio  -—-!  is  determined  by  (96). 

The  relations  (97),  (98),  (99),  (100),  and  (101)  applied  in  the  reverse 
order,  establish  the  second  part  of  the  lemma.  Thus  the  proof  of  the  lemma 
is  complete. 

Suppose  given  a  positive  pair  of  polynomials  h(u),  g(u),  with  h(u) 
of  degree  m.  Then  when  we  divide  g(u)  hj  h{u)  and  denote  the  quotient 
by  Co  and  the  remainder  by  gi{u),  we  obtain : 

9  (u)       ^     ,   9i  (u)       ^     ,       1 


^0+  },  („\  —  ^0  + 


h{u)~~"^  ^  h (u)  ~   0  '    h{u)  ' 
9i{u) 

— 7^  can  be  represented  in  the  form  doU  +  *V-t  ,  where  hi(ii),  like  gi(u), 
is  of  degree  less  than  m.    Hence 


9{u) 


=  ^0  + O^.  (102) 


h{u)        **   '  ,  K  (w)  • 


doU  +  — 


^i(^) 
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Thus,  the  representation  (96)  always  holds  for  a  positive  pair  h{u)  and 
g(u).    By  the  lemma 

Co  ^0,  do>  0, 

and  the  polynomials  hi(u)  and  gi(u)  are  of  degree  m  —  1  and  form  a  posi- 
tive pair. 

When  we  apply  the  same  arguments  to  the  positive  pair  hi{u),  gi{u), 
we  obtain 

f!(!^=Ci+ 1-^,  (102') 

92  (w)         tl 

where 

ci  >  0,  di  >  0, 

and  the  polynomials  h2{u)  £  nd  c/2l>)  are  of  degree  m  —  2  and  form  a  posi- 
tive pair.  Continuing  the  i>rocess,  we  finally  end  up  with  a  positive  pair 
hm  and  Qm,  where  hm  and  Qm  are  constants  of  like  sign.    We  set : 

|™  =  c„.  (102to)) 

Then  it  follows  from  (102),  (102'),  . . . ,  (102('"))  that: 

A  (I*)  -  ^0  +  1 

^0^  + 1 

ci  + j- 


+ 


<^m-lW  +  — 


Using  the  second  part  of  the  lemma,  we  show  similarly  that  for  arbitrary 
Co  ^  0,  Ci  >  0,  .  .  .  ,Cm  >  0,  do>  0,  cZi  >  0,  .  .  .  ,  dm-i  >  0  the  above  con- 
tinued fraction  determines  uniquely  (to  within  a  common  constant  factor)  a 
positive  pair  of  polynomials  h(u)  and  g{u),  where  h{u)  is  of  degree  m  and 
g{u)  is  of  degree  m  when  Co  >  0  and  of  degree  m  —  1  when  Cq  =  0. 

Thus  we  have  proved  the  following  theorem. ^^ 


'2  A  proof  of  Stieltjes '  theorem  that  is  not  based  on  the  theory  of  Cauchy  indices  can 
be  found  in  the  book  [17],  pp.  333-37. 
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Theorem  15  (Stieltjes)  :    If  h(u),  g(u)  is  a  positive  pair  of  polynomials 
and  h(u)  is  of  degree  m,  then 


9iu) 


=  Cn  + 


A  (w)  —  ^0  -r  1 


'^O'^-r  J 


ci  + r- 


(103) 


+ '- 


where 

Co  ^  0,  ci>  0,  . . .,  c^>  0,     d^>0,., .,  6^^_i>  0  . 

Here  Co  =  0  if  g{u)  is  of  degree  m  —  1  and  Co^  0  if  g{u)  is  of  degree  m. 
The  constants  ci,  die  are  uniquely  determined  hy  h{u),  g{u). 

Conversely,  for  arbitrary  Co  ^  0  and  arbitrary  positive  Ci,  .  .  . ,  Cm, 
do,  . . . ,  dm-i,  the  continued  fraction  (103)  determines  a  positive  pair  of 
polynomials  h(u),  g{u),  where  h{u)  is  of  degree  m. 

From  Theorem  13  and  Stieltjes'  Theorem  we  deduce: 

Theorem  16 :  A  real  polynomial  of  degree  n  f{z)  ^h{z^)  +  zg(^z^)  is  a 
Hurwitz  polynomial  if  and  only  if  the  formula  (103)  holds  with  non- 
negative  Co  and  positive  ci, . . . ,  Cm,  do, . . . ,  dm-i-  Here  Co  >  0  when  n  is  odd 
and  Co  =  0  when  n  is  even. 


§  15.     Domain  of  Stability.     Markov  Parameters 

1.  With  every  real  polynomial  of  degree  n  we  can  associate  a  point  of  an 
w-dimensional  space  whose  coordinates  are  the  quotients  of  the  coefficients 
divided  by  the  highest  coefficient.  In  this  '  coefficient  space '  all  the  Hurwitz 
polynomials  form  a  certain  w-dimensional  domain  which  is  determined^^  by 
the  Hurwitz  inequalities  zli  >  0,  ZI2  >  0,  .  .  .  ,  Zl„  >  0,  or,  for  example,  by 
the  Lienard-Chipart  inequalities  an  >  0,  an-2  >  0,  .  .  .  ,  Zli  >  0,  zig  >  0,  .  .  .  . 
We  shall  call  it  the  domain  of  stability.  If  the  coefficients  are  given  as 
functions  of  p  parameters,  then  the  domain  of  stability  is  constructed  in  the 
space  of  these  parameters. 


"For  oo  =  l. 
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The  study  of  the  domain  of  stability  is  of  great  practical  interest;  for 
example,  it  is  essential  in  the  design  of  new  systems  of  governors.^'' 

In  §  17  we  shall  show  that  two  remarkable  theorems  which  were  found 
by  Markov  and  Chebyshev  in  connection  with  the  expansion  of  continued 
fractions  in  power  series  with  negative  powers  of  the  argument  are  closely 
connected  with  the  investigation  of  the  domain  of  stability.  In  formulating 
and  proving  these  theorems  it  is  convenient  to  give  the  polynomial  not  by 
its  coefficients,  but  by  special  parameters,  which  we  shall  call  Markov 
parameters. 

Suppose  that 

/  (z)  =  a^z""  +  aiz«-i  +  •••  +  «„         (tto  ^0) 
is  a  real  polynomial.    We  represent  it  in  the  form 

We  may  assume  that  /t(w)  and  gr(ifc)  are  co-prime  (Zl„  :7^  0) .  We  expand 
the  irreducible  rational  fraction  Y(\  ^^  ^  series  of  decreasing  powers  oiw?^ 

h  (u)  -  *-i  +  u       2*2  i-  ^3      ^4  +       •  •     U"*; 

The  sequence  So,  Si,  ^2,  ...  determines  an  infinite  Hankel  matrix 
iSf  =  II  Si+fc  II  *.    We  define  a  rational  function  B{v)  by 

Then 

iJ(„)  =  _s_i  +  A  +  A  +  ^  +  ...,  (106) 

so  that  we  have  the  relation  (see  p.  208) 

B(v)r^8.  (107) 

Hence  it  follows  that  the  matrix  S  is  of  rank  m  =  [n/2],  since  m,  being 
the  degree  of  h{u),  is  equal  to  the  number  of  poles  of  R{v).'^ 

For  n  =  2m  (in  this  case,  s_i  =  0),  the  matrix  S  determines  the  irre- 
ducible fraction  ?4^  uniquely  and  therefore  determines  f(z)  to  within  a 
h{u)  ^       "" 

■^^  A  number  of  papers  by  Y.  I.  Naimark  deal  with  the  investigation  of  the  domain  of 
stability  and  also  of  the  domains  corresponding  to  various  values  of  fe  (fc  is  the  number  of 
roots  in  the  right  half -plane).     (See  the  monograph  [41].) 

^5  In  what  follows  it  is  convenient  to  denote  the  coefficients  of  the  even  negative  powers 
of  w  by  —  si,  —  S3,  etc. 

76  See  Theorem  8  (p.  207). 
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constant  factor.  For  n  =  2m-\-l,  in  order  to  give  f{z)  by  means  of  S  it  is 
necessary  also  to  know  the  coefficient  s_i. 

On  the  other  hand,  in  order  to  give  the  infinite  Hankel  matrix  8  of  rank 
m  it  is  sufficient  to  know  the  first  2m  numbers  So,  Si,  .  .  . ,  S2m-i.  These 
numbers  may  be  chosen  arbitrarily  subject  to  only  one  restriction 

D^  =  \Si+k\7^0  ;  (108) 

all  the  subsequent  coefficients  S2m,  S2m+i,  ...  of  (104)  are  uniquely  (and 
rationally)  expressible  in  terms  of  the  first  2m :  Sq,  Si,  .  .  . ,  52m- 1-  For  in  the 
infinite  Hankel  matrix  ^S'  of  rank  m  the  elements  are  connected  by  a  recur- 
rence relation  (see  Theorem  7  on  p.  205) 

m 

s^  =  2J  <^^q-g       (g  =  m,  m  +  1,  . . .).  (109) 

9=1 

If  the  numbers  Sq,  Si,  .  . . ,  Sm-i  satisfy  (108),  then  the  coefficients  ai,  a2, .  .  . , 
am  in  ( 109 )  are  uniquely  determined  by  the  first  m  relations ;  the  subsequent 
relations  then  determine  S2m,  S2m+i,  -  -  -  • 

Thus,  a  real  polynomial  f{z)  of  degree  n  =  2m  with  An  y^O  can  be  given 
uniquely^^  by  2m  numbers  So,  Si,  ...,  S2m-i  satisfying  (108).  When 
n  =  2m  +  1,  we  have  to  add  S-i  to  these  numbers. 

We  shall  call  the  n  values  ^o,  Si,  . . . ,  S2m-i  (for  n  =  2m)  or  s-i,  Sq,  . . . , 
S2m-i  (for  n  =  2mH-l)  the  Markov  parameters  of  the  polynomial  f{z). 
These  parameters  may  be  regarded  as  the  coordinates  in  an  ?i-dimensional 
space  of  a  point  that  represents  the  given  polynomial  fiz). 

We  shall  find  out  what  conditions  must  be  imposed  on  the  Markov  para- 
meters in  order  that  the  corresponding  polynomial  be  a  Hurwitz  polynomial. 
In  this  way  we  shall  determine  the  domain  of  stability  in  the  space  of  Markov 
parameters. 

A  Hurwitz  polynomial  is  characterized  by  the  conditions  (94)  and  the 
additional  condition  (95)  for  n  =  2m-\- 1.  Introducing  the  function  B(v) 
(see  (105) ),  we  write  (94)  as  follows: 

ItZR(v)^m,      ItZvR{v)=m.  (110) 

The  additional  condition  (95)  for  w  =  2m  +  1  gives: 

5_i  >  0. 

Apart  from  the  matrix  S=  ||  Si^u  ||  *  we  introduce  the  infinite  Hankel 
matrix  S^^^  =  \\  Si+^+i  11 «.    Then,  since  by  (106) 


'''^  To  within  a  constant  factor. 
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vR{v)=-s_,v  +  So  +  ^  +  ^  +  '-'  , 

the  following  relation  holds: 

vB{v)  -  S^'K  (111) 

The  matrix  S^^\  like  ^S',  is  of  finite  rank  m,  since  the  function  vR{v),  like 
R(v),has  m  poles.    Therefore  the  forms 

m—l  m—1 

i,  k=0  i,  k=0 

are  of  rank  m.  But  by  Theorem  9  (p.  190)  the  signatures  of  these  forms, 
in  virtue  of  (107)  and  (111),  are  equal  to  the  indices  (110)  and  hence  also 
to  m.  Thus,  the  conditions  (110)  mean  that  the  quadratic  forms  Sm{x,  x)  and 
SlJl^x,  x)  are  positive  definite.    Hence: 

Theorem  17 :    A  real  polynomial  f(z)  =h(z^)  +  zg(z^)  of  degree  n  =  2m 
or  n  =  2m  +  lis  a  Hurwitz  polynomial  if  and  only  if  :^* 

1.  The  quadratic  forms 

m—l  m—l 

^m  {^>   ^)  =  2J    Si^k^iXj^ ,        Sll,\x,  X)  =  2J    «z+A+l«i^*  (112) 

t,  k=0  t,  k=0 

are  positive  definite;  amd 

2.  {For  n  =  2m+l) 

5_i>0.  (113) 

Rere  s_i,  ^o,  Si,  . . . ,  S2m-i  c^^e  the  coefficients  of  the  expansion 


h{u) 


—  ^-1-^     U  W2    "^    U»  U*    "^ 


7*  We  do  not  mention  the  inequality  Zl^v^  0  expressly,  because  it  follows  automatically 
from  the  conditions  of  the  theorem.  For  it  f{z)  is  a  Hurwitz  polynomial,  then  it  is  known 
that  zl„  7^  0.  But  if  the  conditions  1.,  2.  are  given,  then  the  fact  that  the  form  8\};}  (x,  x) 
is  positive  definite  implies  that 

j+00Ug{U)  ,-+00  T>    /      X    

—  /        -T-rv  —  I        vB(v)=  niy 

—  ooh(u)  —  oo  ^    '  ' 

and  from  this  it  follows  that  the  fraction  ug(u)/h(u)  is  reduced,  which  can  be  expressed 
by  the  inequality  zl„  ^  0. 

In  exactly  the  same  way,  it  follows  automatically  from  the  conditions  of  the  theorem 
that  Dm  =  I  Si+fc  |^~^  7^  0,  i.e.,  that  the  numbers  So,  Si,  . . . ,  sam-i,  and  (for  n  =  2m  +  1) 
5-1  are  the, Markov  parameters  of  f(z). 
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We  introduce  a  notation  for  the  determinants 

D,  =  I  «.■+.  ir'.      I>V  =  I  ^.■+*+i  ir'      (P  =1,  2,  . . .,  m).        (114) 
Then  condition  1.  is  equivalent  to  the  system  of  determinantal  inequalities 


X»,  =s„>0,  Z»,= 


i)i^>=.i>0,i)«= 


5l  52 


*1  ^2 


^2  ^3 


>o, ...,  i)^ 


>o,...,d: 


>o, 


>0. 


(115) 


If  w  =  2m,  the  inequalities  (115)  determine  the  domain  of  stability  in 
the  space  of  Markov  parameters.  If  n  =  2m  +  1,  we  have  to  add  the  further 
inequality : 

5_i>0.  (116) 

In  the  next  section  we  shall  find  out  what  properties  of  S  follow  from 
the  inequalities  (115)  and,  in  so  doing,  shall  single  out  the  special  class  of 
infinite  Hankel  matrices  S  that  correspond  to  Hurwitz  polynomials. 

§  16.    Connection  with  the  Problem  of  Moments 

1.    We  begin  by  stating  the  following  problem : 

Problem  of  Moments  for  the  Positive  Axis  0  <  t;  <  oo  :^^  Given  a 
sequence  Sq,  Si,  ...  of  real  numbers,  it  is  required  to  determine  positive 
numbers 

such  that  the  following  equations  hold: 


7=1 


(?>  =  0,1,2,...) 


(118) 


It  is  not  difficult  to  see  that  the  system  (118)  of  equations  is  equivalent 
to  the  following  expansion  in  a  series  of  negative  powers  of  u : 


^^  This  problem  of  moments  ought  to  be  called  discrete  in  contrast  to  the  general  expo- 

m 
nential  problem  of  moments,  in  which  the  sums   Z  fi^v'':  are  replaced  by  Stieltjes  integrals 

fvPdfi{v)    (see  [55]). 
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m 


In  this  case  the  infinite  Hankel  matrix  ^=11  Si+fc  ||  *  is  of  finite  rank  m 
and  by  (117)  in  the  irreducible  proper  fraction 


10- 


iM—  y-B—  (120) 

h  (U)  ^i    U  +  Vj 

(we  choose  the  highest  coefficients  of  h{u)  and  giu)  to  be  positive)  the 
polynomials  h{u)  and  g{u)  form  a  positive  pair  (see  (91)  and  (91')). 

Therefore  (see  Theorem  14),  our  problem  of  moments  has  a  solution  if 
and  only  if  the  sequence  Sq,  Si,  $2,  ...  determines  by  means  of  (119)  and 
(120)  a  Hurwitz  polynomial  f(z)  =h{z^)  +  zg{z^)  of  degree  2m. 

The  solution  of  the  problem  of  moments  is  unique,  because  the  positive 
numbers  Vj  and  fij  {j  =  l,2, . . .  ,m)  are  uniquely  determined  from  the  expan- 
sion (119). 

Apart  from  the  'infinite'  problem  of  moments  (118)  we  also  consider 
the  'finite'  problem  of  moments  given  by  the  first  2m  equations  (118) 

tn 

^P=2!/^i^J        {p=0,l,...,2m-l).  (121) 

7=1 

These  relations  already  determine  the  following  expressions  for  the  Hankel 
quadratic  forms : 


f»— 1  m 

m-1 


i,tr=0  j=l 

w— 1  m 

2J  «i+i+i««a:j  =  2J n-Vj (xa  +  XjVj  +■■■  +  a;„_i«"" ')". 


(122) 


i,k=0  j=l 

Since  the  linear  forms  in  the  variables  iCo,  Xi,  . . . ,  Xm-i 

Xq  +  x^Vj  +  •  •  •  +  x^_^v'~^  (/=  1, 2,  . . . ,  m) 


are  independent  (their  coefficients  form  a  non-vanishing  Vandermonde 
determinant),  the  quadratic  forms  (122)  are  positive  definite.  But  then 
by  Theorem  17  the  numbers  So,  Si,  . . . ,  ^2^-1  are  the  Markov  parameters  of 
a  certain  Hurwitz  polynomial  f{z).  They  are  the  first  2m  coefficients  of 
the  expansion  (119).  Together  with  the  remaining  coefficients  52m,  S2m+i,  •  •  • 
they  determine  the  infinite  solvable  problem  of  moments  (118),  which  has 
the  same  solution  as  the  finite  problem  ( 121 ) . 
Thus  we  have  proved  the  following  theorem: 
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Theorem  18  :    1)  The  finite  problem  of  moments 

m 

s^  =  i:fi/j  (123) 

7=1 

(p  =  0, 1, .  . . ,  2m  —  1 ;  //I  >  0, . .  . ,  //„»  >  0 ;  0  <  i^i  <  ^2  <  . . .  <  Vn.) ,  wkere 

Sp  are  given  real  numbers  and  Vj  and  fij  are  unknown  real  numbers 
(p  =  0,  1,  . . . ,  2m  —  1 ;  i  =  1,  2,  . . . ,  m)  has  a  solution  if  and  only  if  the 
quadratic  forms 

m-l  m-1 

2J  Si+j^XiX,,,      2J  Si+k+i^i^k  (124) 


i,k=0 


i,k=0 


are  positive  definite,  i.e.,  if  the  numbers  So,  Si,  . . . ,  S2m-i  c^^e  the  Markov 
parameters  of  some  Hurwitz  polynomial  of  degree  2m. 
2)  The  infinite  problem  of  moments 


Sp—2J  iJ'/j 


(125) 


7=1 


(p  =  0, 1,  2, ...  ; //I  >  0,.  ..,//„»>  0;0  <  t^i  <  1^2  <  ...  <  Vm),  where  Spare 
given  real  numbers  and  Vj  and  fij  are  unknown  real  numbers  (p  =  0,  1,  . . .  ; 
^'  =  1,  2,  . . . ,  m)  has  a  solution  if  and  only  if  1.  the  quadratic  forms  (124) 
are  positive  definite  and  2.  the  infinite  Hankel  matrix  S=\\  Si+k  11  *  is  of 
rank  m,  i.e.,  if  the  series 


_9{u) 
~~h{u) 


(126) 


determines  a  Hurwitz  polynomial  f(z)=h(z^)=zg{z^)  of  degree  2m. 

3)   The  solution  of  the  problem  of  moments,  both  the  finite  (123)  and 
the  infinite  (124)  problem,  is  always  unique. 

2.  We  shall  use  this  theorem  in  investigating  the  minors  of  an  infinite 
Hankel  matrix  ;8^=  ||  Si.^jc  11*  of  rank  m  corresponding  to  some  Hurwitz 
polynomial,  i.e.,  one  for  which  the  quadratic  form  (124)  is  positive  definite. 
In  this  case  the  generating  numbers  So,  Si,  S2,  . . .  oi  8  can  be  represented  in 
the  form  ( 123 ) ,  so  that  for  an  arbitrary  minor  of  S  of  order  h'^mwe  have : 


Si^+k^  '•'  Si^+kh 


%+*i  *  •  •  ^ih+kh 


i"l^l       i"24'    •  •  •    f^nA 


/"m^m 


ki 


Jch 


.*A 
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and  therefore 
y^i   «^2    •  •  •    «^i 


1  ^ «!  <««<••  ■  <<xh^m 


/^a,/*a. 


But  from  the  inequalities 

0  <  ^1  <  ^2  <  •  •  •  <  V^ 


iW«A 


h  <  *2  <  •  •  •  <  h>     K<K<"'<h 


••A 

ki      k. 

:< 

*,       kt 
Vah   Vah    . 

•v% 

(127) 


it  follows  that  the  generalized  Vandermonde  determinants^" 


^aA 


^«A 


>o, 


''«; 

«?. 

kh 
...    < 

< 

«^ 

...    Va. 

4 

4 

.  .  .    «^aA 

>0 


are  positive. 

Since  the  numbers  Hj  are  positive  (i  =  1,  2, 
from  (127)  that 

I u     L    . . .   L\  I        h  <  io<'"<  H 

S{   ^      ^  >0      0^    ^        ^  ^     h  =  l,2,... 

\k^    k^    ...  kj  \       ki<k2<"'< kj^, 


m),  it  therefore  follows 


m 


(128) 

Conversely,  if  in  an  infinite  Hankel  matrix  ^=11  Si+jc  \\  *  of  rank  m  all 
the  minors  of  every  order  Ji^m  are  positive,  then  the  quadratic  forms  (124) 
are  positive  definite. 

Definition  4:  An  infinite  matrix  ^=11  ^ifc  11"  will  he  called  totally 
positive  of  rank  m  if  and  only  if  all  the  minors  of  A  of  order  h-^m  are 
positive  and  all  the  minors  of  order  h  '^  m  are  zero. 

The  property  of  S  that  we  have  found  can  now  be  expressed  in  the  fol- 
lowing theorem  :®^ 

Theorem  19:  An  infinite  Hankel  matrix  S=  11  Si+h  IL*  is  totally  posi- 
tive of  rank  m  if  and  only  if  1)  S  is  of  rank  m  and  2)  the  quadratic  forms 


m-l 

^   ^i^-k^i^k 
t,ife=0 


m-l 

t,Ar=0 


are  positive  definite. 


80  See  p.  99,  Example  1. 
"See  [173]. 
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From  this  theorem  and  Theorem  17  we  obtain : 

Theorem  20:  A  real  polynomial  f{z)  of  degree  n  is  a  Eur  wit  z  poly- 
nomial if  and  only  if  the  corresponding  infinite  Hankel  matrix  S=  11  Si+k  11" 
is  totally  positive  of  degree  m=  [n/2]  and  if,  in  addition,  5_i  >  0  when 
n  is  odd. 

Here  the  elements  So,  Si,  S2,  . . .  of  S  and  s_i  are  determined  hy  the 
expansion 

'M  =  «-i  +  ^--\+'-\--,  (129) 

where 

f{z)  =  h{z^)  +  zg{z^). 

§  17.    Theorems  of  Markov  and  Chebyshev 

1.  In  a  notable  memoir  'On  functions  obtained  by  converting  series  into 
continued  fractions '^^  Markov  proved  two  theorems,  the  second  of  which  had 
been  established  in  1892  by  Chebyshev  by  other  methods,  and  not  in  the 
same  generality.  ®^ 

In  this  section  we  shall  show  that  these  theorems  have  an  immediate  bear- 
ing on  the  study  of  the  domain  of  stability  in  the  Markov  parameters  and 
shall  give  a  comparatively  simple  proof  (without  reference  to  continued 
fractions)  which  is  based  on  Theorem  19  of  the  preceding  section. 

In  proceeding  to  state  the  first  theorem,  we  quote  the  corresponding 
passage  from  the  above-mentioned  memoir  of  Markov  :^* 

On  the  basis  of  what  has  preceded  it  is  not  difficult  to  prove 
two  remarkable  theorems  with  which  we  conclude  our  paper. 
One  is  concerned  with  the  determinants^® 

and  the  other  with  the  roots  of  the  equation^® 


82  Zap.  Petersburg  Akad.  Nauk,  Petersburg,  1894  [in  Eussian] ;  see  also  [38], 
pp.  78-105. 

83  This  theorem  was  first  published  in  Chebyshev 's  paper  'On  the  expansion  in  con- 
tinued fractions  of  series  in  descending  powers  of  the  variable'  [in  Russian].  See  [8], 
pp.  307-62. 

84  [38],  p.  95,  beginning  with  line  3  from  below. 

85  In  our  notation,  Di,  D2,  .  . . ,  Dm,  B^^^ ,  D^^\  .  .  . ,  D^^.     (See  p.  236.) 

86  In  our  notation,  7i( —  x)  =z  0. 
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^0»  ^l>  *2 »  •  •  •  »  ^2m-2 »  *2tn-l 
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two  sets  of  values 

1.  SQ=aQ,Si  =ai,S2  =a,2, 

2.  5o  =  6o>  ^l  — ^l»  «2=^2» 

for  which  all  the  determinants 


^2m-2  —  ^2m-2»  *2m-l  — ^2m-l> 
*2m-2  ~  ^2m-2  >  *27»-l  "  ^2m-l 


A^  =  Sq,A^  = 


Sq     5i 
*1     *2 


»   •••»^m  = 


J<l)  =  5i,J(2)  = 


^2     ^3 


5o  Si   .  .  .   5^_i 

*1  ^2    •  •  •  ^m 

^m-1  ^m  *2m-2 

51  ^2   •  •  •   ^ni 

52  S3    •  .  .   5^^i 


*m     ^m+1       *2m-l 

#i^r7i  oi(.#  #0  he  positive  numbers  satisfying  the  inequalities 

a©  ^  ^0 »  ^1  ^  ^1'  ^2  =  ^2  »  ^3  ^  «3 »    •  •  •  '  ^2m-2  =  hm-2  »  hm-1  ^  «2m-l  > 

#/iew  o^r  determinant 

mws#  &e  positive  for  all  values 

^0 »  ^1 »  *2 »    •  •  • »  ^2»n-l 

satisfying  the  inequalities 

fto  ^  *o  ^  ^0 »  ^1  ^  *i  ^  «i »  ^2  ^  ^2  ^  ^2 »  •  •  •  > 

«2m-2  ^  ^2m-2  ^  ^2m-2 »   ^27n-l  ^  ^2ot-1  ^  ^2m-l ' 


Under  these  conditions  we  have 


a-i      ...      tt: 


k-l 


tto     ...      <*! 


^*-l 


Oj.  ...  a, 


2i-2 


«*-! 


S2   ■  •  '  Sjg 


^k-1       ^k 


•2*-2 


61 


h 


■h-i 


h-i      h  ■ 


.  b. 


24-2 


and 
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61    62 


...  b 


k+l 


Si     S2 
^2     ^3 


^k     ^k+1   •  •  •    ^2k-l  ^k    ^k+1     '  •  ■   ^2k-l  ^k     ^k+1    •     •   ^ 


...  a. 


^2 


''k+l 


^2k-l 


for  k  =  \,2,  . 


m. 


In  order  to  give  another  statement  of  this  theorem  in  connection  with 
the  problem  of  stability,  we  introduce  some  concepts  and  notations. 

The  Markov  parameters  So,  Si,  . . . ,  S2m-i  (for  n  =  2m)  or  s_i,  So,  Si,  . . . , 
S2m-i  (for  n  =  2m  +  1)  will  be  regarded  as  the  coordinates  of  some  point  P 
in  an  ^-dimensional  space.  The  domain  of  stability  in  this  space  will  be 
denoted  by  G.  The  domain  G  is  characterized  by  the  inequalities  (115)  and 
(116)   (p.  236). 

We  shall  say  that  a  point  P^{Si}  'precedes'  a  point  P*  =  {Si*}  and 
shall  write  P  <  P*  if 


^0  —  ^0 '  ^1  =  ^1'  ^2  =  *2  '  ^3  =  ^3  '   • 

and  (for  n  =  2m  +  1) 

5-1  ^«-l 


J  *27K-1  ^  *2m-l 


(130) 


and  the  sign  <  holds  in  at  least  one  of  these  relations. 

If  only  the  relations  (130)  hold,  without  the  last  clause,  then  we  shall 
write : . 

We  shall  say  that  a  point  Q  lies  'between'  P  and  R  if  P  ^  Q  "<  ^• 

To  every  point  P  there  corresponds  an  infinite  Hankel  matrix  of  rank 
m:  ^=  II  Si^-k  |L*.    We  shall  denote  this  matrix  by  S^. 

Now  we  can  state  Markov's  theorem  in  the  following  way : 

Theorem  21  (Markov)  :  If  two  points  P  and  R  belong  to  the  domain  of 
stability  G  and  if  P  precedes  R,  then  every  point  Q  between  P  and  R  also 
belongs  to  G,  i.e., 

from  P,ReG,P^   Q  :<   R  it  follows  that  Q  e  G. 


Proof.    From  P  r<  Q  ^  R  it  follows  that  P  and  Q  can  be  connected 
by  an  arc  of  a  curve 

5i=(— l)>,.(0[a^^^}/;*  =  0,l,...,2m  — land(for7i=2m+l)i  =  — 1]    (131) 
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passing  through  Q  such  that:  1)  the  functions  cpi{t)  are  continuous,  mono- 
tonic  increasing,  and  dif f erentiable  when  t  varies  from  t  =  a  to  t  =  y',  and 
2)  the  values  a,  P,  y  (a  <C  P  <  y)  oi  t  correspond  to  the  points  P,  Q,  R  on 
the  curve. 

From  the  values  (131)  we  form  the  infinite  Hankel  matrix  S  =  8{t)  = 
II  Si+ic(t)  II  *  of  rank  m.  We  consider  part  of  this  matrix,  namely  the  rec- 
tangular matrix 


>m-l 


>jn+l 


>m-l 


^2m-2    ^2m-l 


(132) 


By  the  conditions  of  the  theorem,  the  matrix  S(t)  is  totally  positive  of 
rank  m  for  t  =  a  and  f  =  y,  so  that  all  the  minors  of  (132)  of  order  p  =  1,  2, 
3,  . . . ,  m  are  positive. 

We  shall  now  show  that  this  property  also  holds  for  every  intermediate 
value  of  ^  (a  <  #  <  y). 

For  p  =  1,  this  is  obvious.  Let  us  prove  the  statement  for  the  minors 
of  order  p,  on  the  assumption  that  it  is  true  for  those  of  order  p  —  1.  We 
consider  an  arbitrary  minor  of  order  p  formed  from  successive  rows  and 
columns  of  (132)  : 


D?'  = 


*<Z+1 


Sq+2    ' 


Sq+p 


'ff+p-1    ^q+p    '  •  •  ^q+2p-2 


[q  =  0.l. 


2{m  —  p)  +  ll 


(133) 


We  compute  the  derivative  of  this  minor 


±D^f=   "^l^f  ^^' 


dt 


i,JZo  ^^q+i+k 


q+i+k 

dt      ' 


dD^^^ 


^q+i+k 


(134) 


(i,  k  =  0,  1,  . . . ,  p  —  1)  are  the  algebraic  complements  (adjoints) 


of  the  elements  of  ( 133 ) .    Since  by  assumption  all  the  minors  of  this  determi- 
nant are  positive,  we  have 


^^q+i+k 

On  the  other  hand,  we  find  from  (131) : 


{i,  h=^0,  1,  . 


1). 


(135) 
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(_l).+i+A.^^_^^™.^0         ii,k=0,l,...    ,2,-1).  (136) 
From  (134),  (135),  and  (136)  it  follows  that 


(-l)*^Df^O         {p=l,2,...    ,m,  ).  (137) 


2(m-2?)  +  l,^ 


Thus,  when  the  argument  increases  from  t  =  a,tot^=y,  then  every  minor 
(133)  with  even  ^  is  a  monotone  non-decreasing  function  and  with  odd  q 
is  a  monotone  non-increasing  function ;  but  since  the  minor  is  positive  for 
t  =  a  and  t  =  y,  it  is  also  positive  for  every  intermediate  value  of  t 
{a<t<y). 

From  the  fact  that  the  minors  of  (132)  of  order  p  —  1  and  those  of  order 
p  that  are  formed  from  successive  rows  and  columns  are  positive,  it  now 
follows  that  all  the  minors  of  (131)  of  order  p  are  positive.®^ 

What  we  have  proved  implies  that  for  every  t  {a  ^  t  ^  y)  the  values 
So,  Si,  . . . ,  S2m-i  and  (for  7i  =  2m  +  1)  s_i  satisfy  the  inequalities  (115) 
and  (116),  i.e.,  that  for  every  t  these  values  are  the  Markov  parameters  of 
a  certain  Hurwitz  polynomial.  In  other  words,  the  whole  arc  (131)  and, 
in  particular,  the  point  Q  lies  in  the  domain  of  stability  G. 

This  completes  the  proof  of  Markov's  Theorem. 

Note.  Since  we  have  proved  that  every  point  of  the  arc  (131)  belongs 
to  G,  the  values  of  (131)  for  every  t  (a  ^  #  ^  y)  determine  a  totally  posi- 
tive matrix  8(t)  =  ||  Si+jc{t)  ||  *  of  rank  m.  Therefore  the  inequalities  (135) 
and  consequently  (137)  as  well  hold  for  every  t  (a^t  ^y),  i.e.,  with  in- 
creasing t  every  D^^^  increases  for  even  q  and  decreases  for  odd  q  (q  =  0,  1, 
2,  . . . ,  2 (m  —  p)  +  1 ;  p  =  1,  . . . ,  m) .  In  other  words,  from  P  ^  Q  i<  R 
it  follows  that 

(_  iyD'^\sp)  ^  (-  lyn^fiZi)  ^  (-  iyD<fm 

(^  =  0,  1,  ...,  2{m  —  p)  +  l;  p=l,  ...,  m). 

These  inequalities  for  q  =  0,  1  give  Markov's  inequalities  (pp.  241). 

We  now  come  to  the  Chebyshev-Markov  theorem  mentioned  at  the  be- 
ginning of  this  section.    Again  we  quote  from  Markov 's  memoir :®® 


^■^  This  follows  from  Fekete's  determinant  indentity  (see  [17],  pp.  306-7). 
88  See  [38],  p.  103,  beginning  with  line  5. 
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Sn,      S 


IJ    "-2' 


0>    "^iJ    "^2' 


»   "2m-2»   "'2771-1' 
»    *2to— 2»    ^2m-l» 


^0»    ^1'    ^2'    •  •  • '    ^27W-2'    "2m-l 

satisfy  all  the   conditions  of  the  preceding   theorem,^^  then  the 
equations 


0, 


«0 

«1 

•  •  «m-l 

1 

«1 

^2 

•••  «m 

X 

«2 

«3 

•  •   «m+l 

a;2 

«m 

«m+l 

•  •  •   «2m-l 

x"" 

*0 

^1 

•  •    -^^-l 

1 

«1 

«2 

'"    Sm 

a; 

«2 

*3 

•  •    ^m+1 

a;2 

Sm 

«m+l 

•••    «2m-l 

x"^ 

bo 

6i       . 

••    K-1 

1 

h 

h 

..    &,n 

X 

&2 

h       . 

••    ^Tn+l 

x^ 

&,„ 

^m+l 

•••    hm-1 

x^ 

=  0, 


of  degree  m  in  the  unknown  x  do  not  have  multiple  or  imaginary 
or  negative  roots. 

And  the  roots  of  the  second  equation  are  larger  than  the  cor- 
responding roots  of  the  first  equation  and  smaller  than  the  cor- 
responding roots  of  the  last  equation. 

Let  us  find  out  the  connection  of  this  theorem  with  the  domain  of  sta- 
bility in  the  space  of  the  Markov  parameters.  Setting  f(z)='h{z^)+zg{z^) 
and 

h  (-  v)  =  Cov"'  +  Cjt;"*-!  +  •  ••  +  c„      (coT^O), 


we  obtain  from  the  expansion  (105) 
the  identity 


h{-v) 


—  —  s. 


89  He  refers  to  the  preceding  theorem,  Markov's  theorem  on  determinants  (pp.  241). 
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-9'(-^)-(-^-i  +  ^  +  $+---)M"»+Cit;-i  +  ...  +  cJ. 
Equating  to  zero  the  coefficients  of  the  powers  v~'^,  v~^,  . . . ,  v~^j  we  find : 

«oCm      +  5  A-i  +  •  •  •  +  S^Cq       ==:  0, 


+  SzCm-l   +'"  +  Sm+1  %  ^  0, 


^m-l^m  +  ^nfim-1  +  l~  *27n-1^0  ~  ^  '■ 


(138) 


to  these  relations  we  add  the  equation 


written  as 


C;„  +  «^c^-i  +  ---  +  ^%=0. 


Eliminating  from  (138)  and  (139')  the  coefficients  Co,  Ci, 
sent  the  equation  (139)  in  the  form 


5o      Si 


S<%       Sv 


.  5, 


m-1 


.  5, 


...  5, 


m+1 


^m     ^m+1 


'27n-l 


=  0. 


(139) 

(139') 
Cm,  we  repre- 


(139") 


Thus,  the  algebraic  equation  in  the  Chebyshev-Markov  theorem  coincides 
with  (139)  and  the  inequalities  imposed  on  Sq,  Si,  . . . ,  S2m-i  coincide  with 
the  inequalities  (115)  that  determine  the  domain  of  stability  in  the  space 
of  the  Markov  parameters. 

The  Chebyshev-Markov  theorem  shows  how  the  roots  Ui  =  —  Vi, 
U2  =  —  V2,  . . . ,  Um  =  —  Vm  oi  h{io)  chaugc  whcu  the  corresponding  Markov 
parameters  ^o,  Si,  . . . ,  S2m-i  vary  in  the  domain  of  stability. 

The  first  part  of  the  theorem  states  something  we  already  know :  When 
the  inequalities  (115)  are  satisfied,  then  all  the  roots  Ui,  U2,  • .  • ,  Um  oi  h{u) 
are  simple,  real,  and  negative.®^    We  denote  them  as  follows : 

Ui{P),U2(P),...,Um{P), 

where  P  is  the  corresponding  point  of  G. 

The  second  (fundamental)  part  of  the  Chebyshev-Markov  theorem  can 
be  stated  as  follows: 


90  See  Theorem  13,  on  p.  228. 
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Theorem  22  (Chebyshev-Markov)  :    //  P  and  Q  are  two  points  of  G  and 
P  ^precedes'  Q, 

P^Q,  (140) 

Ui(P)    <  Ui(Q),  U2(P)    <  U2(Q),  .  .  .  ,  Um(P)    <  Um{Q)  .  (141) 

Proof.    The  coefficients  of.  h{u)  can  be  expressed  rationally  in  terms  of 
the  parameters  So,  Si,  . . . ,  S2m-i.^^    Then 

h{Ui)=0        (i  =  l,  2,  ...,  m) 
implies  that  :^^ 

W|)  +  ^'(^.)^=0        (i=l,  2,  ...,m;     Z  =  0,  1,  ...,2m-l).       (142) 

On  the  other  hand,  when  we  differentiate  the  expansion 

h{u)~~     -^~^     U  ^2    -t-    ^3 

term  by  term  with  respect  to  s,  we  find : 

I.  /  V  dg{u)  .  ^  dh{u) 

Multiplying  both  sides  of  this  equation  by    _  '    and  denoting  the  coeffi- 
cient of  u^  in  this  polynomial  by  Cu,  we  obtain : 

dh{u) 


h{u)    dgju)  _  ^^^^    dsi     ^  (—  lyCjt 
u  —  Ui     dsi  u  —  Ui  u 


+  •...  (144) 


Comparing  the  coefficients  of  1/u  (the  residues)  on  the  two  sides  of  (144), 
we  find : 

(-l)'-i?(«..)^"^=C.,,  (145) 

which  gives  in  conjunction  with  (142)  : 

dui  (—  lyCii 


dsi        g{ui)h'{ui) 


^1  In  other  words,  the  roots  Ui,  Us,  . . . ,  Um  increase  with  increasing  So,  Sz,  . . .,  S2m-2  and 
with  decreasing  Si,  S3,  . . . ,  S2m-i, 

^2  For  example,  by  the  equations  (138)  if,  for  simplicity,  we  set  Co=  1  in  these  equations 
df^i^i)        \dh(u) 


-Here^  =  [?^], 
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we  obtain  the  formula  of  Chehyshev-Markov : 

S=wSff         (-1.2.  •••>«;     l  =  0,l,...,2r.-l).     (147) 

But  in  the  domain  of  stability  the  values  Ri  {i=l,  2,  . . . ,  m)  are  positive 
(see  (90')  on  p.  226).    The  same  can  be  said  of  the  coefficients  Cu.    For 

P^=4  {n  +  v^f . .  •  (^  +  v,_^f  {u  +  V,)  {u  +  t;,+i)2  •  •  •  (^  +  vj\     (148) 

where 

«^»=-«*i>0     (*=1,  2,  ...,  m). 

From  (148)  it  is  clear  that  all  the  coefficients  Qu  in  the  expansion  of  - — — 

in  powers  of  u  are  positive.    Thus,  we  obtain  from  the  Chebyshev-Markov 
formula : 

(-iy|^>0.  (149) 

In  the  proof  of  Markov's  theorem  we  have  shown  that  any  two  points 
P  -<Q  of  G  can  be  joined  by  an  arcsz=  ( —  iyq)i{t)  (?  =  0, 1, . . . ,  2m  —  1), 
where  q)i{t)  is  a  monotonic  increasing  differentiable  function  oi  t  {t  varies 
within  the  limits  a  and  ^  (a  <  /5)  and  t=^a  corresponds  to  P,  t^ ^  io  Q). 
Then  along  this  arc  we  have,  by  (149)  :^* 

1=0  '' 

Hence  by  integrating  we  obtain: 

^i(t=a)  ='^i  (P)<'^Ht=.p)  ='^i  «?)  (*=  1'  2,   . . .,  m). 

This  completes  the  proof  of  the  Chebyshev-Markov  theorem. 

§  18.    The  Generalized  Routh-Hurwitz  Problem 

1.    In  this  section  we  shall  give  a  rule  to  determine  the  number  of  roots  in  the 
right  half -plane  of  a  polynomial  f{z)  with  complex  coefficients. 


dsi  dtpi 

»*  Since  ( —  1)^"^  =   -^  =0  (a^t^^)  and  for  at  least  one  I  there  exist  values 

of  i  for  which  (—  1)?  -^  >  0. 
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Suppose  that 

/  (*2!)==  V"  +  ^i^"""'  +  "'  +  K+i  K^"  +  ai^""-^  +  '"  +  aj,        (151) 

where  ao,  ai,  . . . ,  On,  bo,  hi,  . . . ,  hn  are  real  numbers.    If  the  degree  oi  f{z) 
is  n,  then  &o  +  w^o  =7^  0.     Without  loss  of  generality  we  may  assume  that 
dQj^O  (otherwise  we  could  replace  f{z)  hy  if  {z)). 
We  shall  assume  that  the  real  polynomials 

ao2"  +  ai2"-'  +•••  +  ««    and     b^z""  +  b^z^'^  +  "'  +  b,  (152) 

are  co-prime,  i.e.,  that  their  resultant  does  not  vanish  :^^ 


F.«  + 


ao 

«!    .. 

•   «n 

0 

...  0 

bo 

b, .. 

.    &n 

0 

...  0 

0 

«o  •• 

•   «n-l 

a« 

...  0 

0 

bo  .. 

•fen-l 

&n 

...  0 

(153) 


Hence  it  follows,  in  particular,  that  the  polynomials  (152)  have  no  roots  in 
common  and  that,  therefore,  f{z)  has  no  roots  on  the  imaginary  axis. 

We  denote  by  k  the  number  of  roots  of  f(z)  with  positive  real  parts. 
By  considering  the  domain  in  the  right  half -plane  bounded  by  the  imaginary 
axis  and  the  semi-circle  of  radius  B  {R-^  00)  and  by  repeating  verbatim 
the  arguments  used  on  p.  177  for  the  real  polynomial  f{z),  we  obtain  the 
formula  for  the  increment  of  arg/(2!)  along  the  imaginary  axis 


^tZ  ^rg  / (=2)  =  (^  —  2^)  ^• 
Hence  we  obtain,  by  ( 151 ) ,  in  view  of  Oo  7^  0 : 


/+ 


n  —  2k. 


Using  Theorem  10  of  §  11  (p.  215),  we  now  obtain: 
Jc=V{l,  P2,  V„  ...,  F2J, 


(154) 


(155) 


(156) 


where 


^^  Tan  is  a  determinant  of  order  2n. 
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Iq      «! 


h  '•'  K- 


2p-l 


0     ttQ  . . .  a^ 

0       bn    .  .  .   ttc 


5-2 


{p  =  l,  2,  ...,  n;  aj,=  hk=^0  for  k>n).      (157) 


We  have  thus  arrived  at  the  following  theorem. 
Theorem  23:    If  a  complex  polynomial  f{z)  is  given  for  which 
/(^2)=6o25"  +  &l2^"~'  +  •••  +  &„  +  *K2"  +  «lZ"-l +  •••  +  «„)  Kt^O) 

and  if  the  polynomials  OoZ*^  +  . . .  +  o^  am,d  hoZ^  +...  +  &«  dre  co-prime 
( ^2nv^  0),  then  the  number  of  roots  of  f{z)  in  the  right  half-plane  is  deter- 
mined by  the  formulas  (156)  and  (157). 

Moreover,  if  some  of  the  determinants  (157)  vanish,  then  for  each  group 
of  successive  zeros 

(I^2A7^0)    F2A+2=-- -=^2^+2^  =  0    (^2^+2^+2  7^0)  (158) 

in  the  calculation  of  V(l,  Pg*  ^4>  •  •  •»  ^2n)  ^^  ^ws#  set: 

7(9-1) 

a;««  17 

2A 


signI^2A+2/  =  (— 1)     ^     sign 

or,  what  is  the  same, 

^(^2A>     '^2A+2»    •••>    ^2A+2p»    ^2A+2p+2) 

^^    for  odd  p, 

~ 

- — ^ /or  even  n  and 

0  =  1.  2,  ...,p) 


(159) 


for  even  p  and  £  =  (—1)2  sign 


(160) 


We  leave  it  to  the  reader  to  verify  that  in  the  special  case  where  f{z) 
is  a  real  polynomial  we  can  obtain  the  Routh-Hurwitz  theorem  (see  §  6) 
from  Theorem  23.»« 

In  conclusion,  we  mention  that  in  this  chapter  we  have  dealt  with  the 
application  of  quadratic  forms  (in  particular,  Hankel  forms)  to  one  problem 
of  the  disposition  of  the  roots  of  a  polynomial  in  the  complex  plane.  Quad- 
ratic and  hermitian  forms  also  have  interesting  applications  to  other  prob- 
lems of  the  disposition  of  roots.  We  refer  the  reader  who  is  interested  in 
these  questions  to  the  survey,  already  quoted,  of  M.  G.  Krem  and  M.  A. 
Naimark  'The  method  of  symmetric  and  hermitian  forms  in  the  theory 
of  separation  of  roots  of  algebraic  equations,'  (Kharkov,  1936). 


^6  Suitable  algorithms  for  the  solution  of  the  generalized  Eouth-Hurwitz  problem  can 
be  found  in  the  monograph  [41]  and  in  the  paper  [39].    See  also  [7]  and  [37] . 


BIBLIOGRAPHY 


BIBUOGRAPHY 

Items  in  the  Russian  language  are  indicated  by  * 

Part  A.    Textbooks,  Monographs,  and  Surveys 

[1]     AcHiESER  (Akhieser),  N.  J.,  Theory  of  Approximation.    New  York:  Ungar,  1956. 

[Translated  from  the  Eussian.] 
[2]     AiTKEN,  A.  C,  Determinants  and  matrices.    9th  ed.,  Edinburgh:  Oliver  and  Boyd, 

1956. 
[3]     Bell^man,  R.,  Stability  Theory  of  Differential  Equations.    New  York:  McGraw- 
Hill,  1953. 
*[4]     Bernstein,  S.  N.,  Theory  of  Probability.    4th  ed.,  Moscow:  Gostekhizdat,  1946. 
[5]     BODEWIG,  E.,  Matrix  Calculus.    2nd  ed.,  Amsterdam:  North  Holland,  1959. 
[6]     Cahen,  G.,  Elements  du  calcul  matriciel.    Paris:  Dunod,  1955. 
*[7]     ChebotarKv,  N.  G.,  and  Meiman,  N.  N.,  The  problem  of  Eouth-Eurwitz  for  poly- 
nomials and  integral  functions.     Trudy  Mat.  Inst.  Steklov.,  vol.  26  (1949). 
*[8]     Chebyshev,  p.  L.,  Complete  collected  worTcs.    vol.  III.  Moscow:  Izd.  AN  SSSR, 

1948. 
*[9]     Chetaev,  N.  G.,  Stability  of  motion.    Moscow:  Gostekhizdat,  1946. 
[10]     CoLJuATZ,  1j.,  Eigenwertaufgaben  mit  technischen  Anwendungen.    Leipzig:  Akad. 
Velags.,  1949. 

[11]     Eigenwertprobleme    und    ihre    numerische    Behandlung.      New    York: 

Chelsea,  1948. 
[12]     Courant,  R.  and  Hilbert,  D.,  Methods  of  Mathematical  Physics,  vol.  I.    Trans, 
and  revised  from  the  German  original.    New  York:  Interscience,  1953. 
*[13]     Erugin,  N.  E.,  The  method  of  Lappo-Danilevskil  in  the  theory  of  linear  differen- 
tial equations,    Leningrad:  Leningrad  University,  1956. 
*[14]     Faddeev,  D.  K.  and  Sominskit,  I.  S.,  Problems  in  higher  algebra.  2nd  ed.,  Moscow, 
1949;  5th  ed.  Moscoav:  Gostekhizdat,  1954. 
[15]     Faddeeva,  V.  N.,  Computational  methods  of  linear  algebra.    New  York:  Dover 

Publications,  1959.     [Translated  from  the  Russian.] 
[16]     Frazer,  R.  a.,  Duncan,  W.  J.,  and  Collar,  A.,  Elementary  Matrices  and  Some 
Applications  to  Dynamics  and  Differential  Equations.     Cambridge:  Cambridge 
University  Press,  1938. 
>*■   *[17]     Gantmacher  (Gantmakher),  F.  R.  and  KrEin,  M.  G.,  Oscillation  matrices  and 
Tcernels  and  small  vibrations  of  dynamical  systems.     2nd  ed.,  Moscow:  Gostekh- 
izdat, 1950.  [A  German  translation  is  in  preparation.] 
[18]     Q'ROB^Y.'BifW.,  Matrizenrechnung.    Munich:  Oldenburg,  1956. 
[19]     Hahn,  W.,  Theorie  und  Anwendung  der  direlcten  Methode  von  Lyapunov  (Ergeb- 
nisse  der  Mathematik,  Neue  Folge,  Heft  22).    Berlin:  Springer,  1959.     [Contains 
an  extensive  bibliography.] 

253 


254  Bibliography 

[20]     Ince,  E.  L.,  Ordinary  Differential  Equations.    New  York:  Dover,  1948. 
[21]     Jung,  H.,  Matrizen  und  Determinanten.    Eine  Einfiihrung.    Leipzig,  1953. 
[22]     Klein,  F.,  Vorlesungen  uber  Mhere  Geometrie.    3rd  ed.,  New  York :  Chelsea,  1949. 
[23]     KowALEWSKi,  G.,  EinfUhrung  in  die  Determinantentheorie.    3rd  ed..  New  York: 
Chelsea,  1949. 
*[24]     Krein,  M.  G.,  Fundamental  propositions  in  the  theory  of  X-zone  stability  of  a 
canonical  system   of   linear   differential  equations   with  periodic   coefficients. 
Moscow:  Moscow  Academy,  1955. 
*[25]     Krein,  M.  G.  and  Naimark,  M.  A,,  The  method  of  symmetric  and  hermitian  forms 
in  the  theory  of  separation  of  roots  of  algebraic  equations.     Kharkov:  GNTI, 
1936. 
*[26]     Krein,  M.  G.  and  Eutman,  M.  A.,  Linear  operators  leaving  a  cone  in  a  Banach 

space  invariant.    Uspehi  Mat.  Nauk,  vol.  3  no.  1,  (1948). 
*[27]     Kudryavchev,  L.  D.,  On  some  mathematical  problems  in  the  theory  of  electrical 

networTcs.    Uspehi  Mat.  Nauk,  vol.  3  no.  4  (1948). 
*[28]     Lappo-Danilevskii,  I.  A.,  Theory  of  functions  of  matrices  and  systems  of  linear 
differential  equations.    Moscow,  1934. 

[29]     Memoires  sur  la  theorie  des  systemes  des  equations  differentielles  line- 

aires.     3  vols.,  Trudy  Mat.  Inst.  Steklov.  vols.  6-8   (1934-1936).     New  York: 
Chelsea,  1953. 
[30]     Lefschetz,  S.,  Differential  Equations:   Geometric  Theory.     New  York:   Inter- 
science,  1957. 
[31]     Lichnerowicz,  A.,  Algebre  et  analyse  lineaires.    2nd  ed.,  Paris:  Masson,  1956. 
[32]     Lyapunov  (Liapounoff),  A.  M.,  Le  Probleme  general  de  la  stabilite  du  mouve- 
ment   (Annals  of  Mathematics  Studies,  No.  17).     Princeton:   Princeton  Univ. 
Press,  1949. 
[33]     MacDuffee,  C.  C,  The  Theory  of  Matrices.    New  York:  Chelsea,  1946. 

[34]     Vectors  and  matrices.    La  Salle:  Open  Court,  1943. 

*[35]     Malkin,  I.  G.,  The  method  of  Lyapunov  and  Poincare  in  the  theory  of  non-linear 
oscillations.    Moscow:  Gostekhizdat,  1949. 

[36]     Theory  of  stability  of  motion.    Moscow:  Gostekhizdat,  1952.    [A  German 

translation  is  in  preparation.] 
[37]     Marden,  M.,  The  geometry  of  the  zeros  of  a  polynomial  in  a  complex  variable 
(Mathematical  Surveys,  No.  3).    New  York:  Amer.  Math.  Society,  1949. 
*[38]     Markov,  A.  A.,  Collected  worTcs.    Moscow,  1948. 

*[39]     Meiman,  N.  N.,  Some  problems  in  the  disposition  of  roots  of  polynomials.    Uspehi 
Mat.  Nauk,  vol.  4  (1949). 
[40]     MiRSKY,  L.,  An  Introduction  to  Linear  Algebra.     Oxford:   Oxford  University 
Press,  1955. 
*[41]     Naimark,  Y.  I.,  Stability  of  linearized  systems.    Leningrad:  Leningrad  Aero- 
nautical Engineering  Academy,  1949. 
[42]     Parodi,  M.,  Sur  quelques  proprietes  des  valeurs  caracteristiques  des    matrices 
carries  (Memorial  des  Sciences  Mathematiques,  vol.  118),  Paris:  Gauthiers-Villars, 
1952. 

[43]     Perlis,  S.,  Theory  of  Matrices.    Cambridge.  (Mass.)  :  Addison-Wesley,  1952. 
[44]     PiCKERT,  G.,  Normalformen  von  Matrizen   (Enz.  Math.  Wiss.,  Band  I,  Teil  B. 
Heft  3,  Teil  I).  Leipzig:  Teubner,  1953. 
*[45]     PoTAPOV,  V.  p..  The  multiplicative  structure  of  J-inextensible  matrix  functions. 
Trudy  Moscow  Mat.  Soc,  vol.  4  (1955). 


Bibliography  255 

EoMANOVSKii,  V.  I.,  Discrete  Markov  chains.    Moscow:  Gostekhizdat,  1948. 
EouTH,  E.  J.,  A  treatise  on  the  stability  of  a  given  state  of  motion.     London: 
Macmillan,  1877. 

The  advanced  part  of  a  Treatise  on  the  Dynamics  of  a  Rigid  Body. 

6th  ed.,  London:  Macmillan,  1905;  repr.,  New  York:  Dover,  1959. 
ScHLESiNGER,  L.,  Vorlesungen  uber  lineare  Differentialgleichungen.    Berlin,  1908. 

Einf'iihrung  in  die  Theorie  der  gew'dhnlichen  Differentialgleichungen  auf 

funhtionentheoretischer  Grundlage.    Berlin,  1922. 

ScHMEiDLER,  W.,  Vortrage  uber  Determinanten  und  Matrizen  mit  Anwendungen 

in  Physilc  und  Technik.    Berlin:  Akademie-Verlag,  1949. 

SCHREIER,  O.  and  Sperner,  E.,  Vorlesungen  Uber  Matrizen.     Leipzig:  Teubner, 

1932.     [A  slightly  revised  version  of  this  book  appears  as  Chapter  V  of  [53].] 

— Introduction  to  Modern  Algebra  and  Matrix  Theory.    New  York :  Chelsea, 

1958. 

Schwerdtfeger,  H.,  Introduction  to  Linear  Algebra  and  the  Theory  of  Matrices. 
Groningen:  Noordhoff,  1950. 

Shohat,  J.  A.  and  Tamarkin,  J.  D.,  The  problem  of  moments  (Mathematical 
Surveys,  No.  1).    New  York:  Amer.  Math.  Society,  1943. 

Smirnow,  W.  I.  (Smirnov,  V.  I.),  Lehrgang  der  hoheren  MathematiTc,  Vol.  III. 
Berlin,  1956.  [This  is  a  translation  of  the  13th  Russian  edition.] 
Specht,  W.,  Algebraische  Gleichungen  mit  reellen  oder  Icomplexen  Koeffizienten 
(Enz.  Math.  Wiss.,  Band  I,  Teil  B,  Heft  3,  Teil  II).  Stuttgart:  Teubner,  1958. 
Stieltjes,  T.  J.,  Oeuvres  Completes.  2  vols.,  Groningen:  Noordhoff. 
Stoll,  R.  R.,  Linear  Algebra  and  Matrix  Theory.  New  York :  McGraw-Hill,  1952. 
Thrall,  R.  M.  and  Tornheim,  L.,  Vector  spaces  and  matrices.  New  York: 
Wiley,  1957. 

TuRNBULL,  H.  W.,  The  Theory  of  Determinants,  Matrices  and  Invariants.    Lon- 
don: Blackie,  1950. 

TuRNBULL,  H.  W,  and  Aitken,  A.  C,  An  Introduction  to  the  Theory  of  Canonical 
Matrices.    London :  Blackie,  1932. 

VoLTERRA,  V.  et  HosTiNSKY,  B.,  Operations  infinitisimales  lineaires.     Paris: 
Gauthiers-Villars,  1938. 

Wedderburn,  J.  H.  M.,  Lectures  on  matrices.    New  York :  Amer.  Math.  Society, 
1934. 

Weyl,  H.,  Mathematische  Analyse  des  Raumproblems.    Berlin,  1923.     [A  reprint 
is  in  preparation:  Chelsea,  I960.] 

Wintner,  a.,  SpeTctraltheorie  der  unendlichen  Matrizen.    Leipzig,  1929. 
Zurmuhl,  R.,  Matrizen.    Berlin,  1950. 

Part  B.    Papers 

Afriat,  S.,  Composite  matrices,  Quart.  J.  Math.  vol.  5,  pp.  81-89  (1954). 
AiZERMAN  (Aisermann),  M.  A.,  On  the  computation  of  non-linear  functions  of 
several  variables  in  the  investigation  of  the  stability  of  an  automatic  regulating 
system,  Avtomat.  i  Telemeh.  vol.  8  (1947). 
[103]  Aisermann,  M.  A.  and  F.  R.  Gantmacher,  Determination  of  stability  by  linear 
approximation  of  a  periodic  solution  of  a  system  of  differential  equations  with 
discontinuous  right-hand  sides.  Quart.  J.  Mech.  Appl.  Math.  vol.  11,  pp.  385-98 
(1958). 


256  Bibliography 

104]     AiTKEN,  A.  C,  studies  in  practical  mathematics.     The  evaluation,  with  applica- 
tions, of  a  certain  triple  product  matrix.     Proc.  Eoy.  Soc.  Edinburgh  vol.  57, 

(1936-37). 
105]     Amir  Moez  Ali,  E.,  Extreme  properties  of  eigenvalues  of  a  hermitian  transforma- 
tion and  singular  values  of  the  sum  and  product  of  linear  transformations,  Duke 

Math.  J.  vol.  23,  pp.  463-76  (1956). 
106]     Artashenkov,  P.  V.,  Determination  of  the   arbitrariness  in  the  choice   of  a 

matrix  reducing  a  system  of  linear  differential  equations  to  a  system  with  con- 
stant coefficients.     Vestnik  Leningrad.  Univ.,  Ser.  Mat.,  Phys.  i  Chim.,  vol.  2, 

pp.  17-29  (1953). 
[107]     Arzhanych,  I.  S.,  Extension  of  Erylov's  method  to  polynomial  matrices,  Dokl. 

Akad.  Nauk  SSSR,  Vol.  81,  pp.  749-52  (1951). 
"108]     AzBELEV,  N.  and  R.  Vinograd,  The  process  of  successive  approximations  for  the 

computation  of  eigenvalues  and  eigenvectors,  Dokl.  Akad.  Nauk.,  vol.  83,  pp.  173- 

74  (1952). 
109]     Baker,  H.  F.,  On  the  integration  of  linear  differential  equations,  Proc.  London 

Math.  Soc,  vol.  35,  pp.  333-78  (1903). 
110]     Barankin,  E.  W.,  Bounds  for  characteristic  roots  of  a  matrix.  Bull.  Amer.  Math. 

Soc,  vol.  51,  pp.  767-70  (1945). 
"Ill]     Bartsch,  H.,  Abschatzungen  fur  die  Kleinste  charaTcteristische  Zahl  einer  positiv- 

definiten  hermitschen  Matrix,  Z.  Angew.  Math.  Mech.,  vol.  34,  pp.  72-74  (1954). 
112]     Bellman,  E.,  Notes  on  matrix  theory,  Amer.  Math.  Monthly,  vol.  60,  pp.  173-75, 

(1953);  vol.  62,  pp.  172-73,  571-72,  647-48  (1955);  vol.  64,  pp.  189-91  (1957). 
113]     Bellman,  E.  and  A.  Hoffman,  On  a  theorem  of  OstrowsTci,  Arch.  Math.,  vol.  5, 

pp.  123-27  (1954). 
114]     Bendat,  J.  and  S.  Silverman,  Monotone  and  convex  operator  functions,  Trans. 

Amer.  Math.  Soc,  vol.  79,  pp.  58-71  (1955). 
115]     Berge,  C,  Sur  une  propriete  des  matrices  douMement  stochastiques,  C.  E.  Acad. 

Sci.  Paris,  vol.  241,  pp.  269-71  (1955). 
116]     BiRKHOFF,  G.,  On  product  integration,  J.  Math.  Phys.,  vol.  16,  pp.  104-32  (1937). 
"117]     BiRKHOFF,   G.  D.,  Equivalent  singular  points  of  ordinary   linear  differential 

equations.  Math.  Ann.,  vol.  74,  pp.  134-39  (1913). 
118]     BoTT,  E.  and  E.  Duffin,  On  the  algebra  of  networks.  Trans.  Amer.  Math.  Soc, 

vol.  74,  pp.  99-109  (1953). 
119]     Brauer,  a..  Limits  for  the  characteristic  roots  of  a  matrix,  Duke  Math.  J., 

vol.  13,  pp.  387-95  (1946)  ;  vol.  14,  pp.  21-26  (1947)  ;  vol.  15,  pp.  871-77  (1948)  ; 

vol.  19,  pp.'73-91,  553-62  (1952);  vol.  22,  pp.  387-95  (1955). 
120]     ^ber  die  Lage  der  charalcteristischen  Wurzeln  einer  Matrix,  J.  Eeine 

Angew.  Math.,  vol.  192,  pp.  113-16  (1953). 
121]     Bounds  for  the  ratios  of  the  coordinates  of  the  characteristic  vectors  of 

a  matrix,  Proc.  Nat.  Acad.  Sci.  U.S.A.,  vol.  41,  pp.  162-64  (1955). 
122]     The  theorems  of  Ledermann  and  OstrowsTci  on  positive  matrices,  Duke 

Math.  J.,  vol.  24,  pp.  265-74  (1957). 
123]     Brenner,  J.,  Bounds  for  determinants,  Proc.  Nat.  Acad.  Sci.  U.S.A.,  vol.  40, 

pp.  452-54  (1954);  Proc  Amer.  Math.  Soc,  vol.  5,  pp.  631-34  (1954);  vol.  8, 

pp.  532-34  (1957);  C.  E.  Acad.  Sci.  Paris,  vol.  238,  pp.  555-56  (1954). 
124]     Bruijn,  N.,  Inequalities  concerning  minors  and  eigenvalues,  Nieuw  Arch.  Wisk., 

vol.  4,  pp.  18-35  (1956). 
125]     Bruijn,  N.  and  G.  Szekeres,  On  some  exponential  and  polar  representatives  of 

matrices,  Nieuw  Arch.  Wisk.,  vol.  3,  pp.  20-32  (1955). 


Bibliography  257 

126]     Bulgakov,  B.  V.,  The  splitting  of  rectangular  matrices,  Dokl.  Akad.  Nauk  SSSR, 

vol.  85,  pp.  21-24  (1952). 
127]     Cayley,  a.,  a  memoir  on  the  theory  of  matrices,  Phil.  Trans.  London,  vol.  148, 

pp.  17-37  (1857) ;  Coll.  Works,  vol.  2,  pp.  475-96. 
^128]     CoLLATZ,  L.,  Einschliessungssats  fiir  die  charakteristischen  Zahlen  von  Matrizen, 

Math.  Z.,  vol.  48,  pp.  221-26  (1942). 
129]     Pher  monotone  systeme  linearen  Ungleichungen,  J.  Reine  Angew.  Math., 

vol.  194,  pp.  193-94  (1955). 
130]     Cremer,  L.,  Die  Verringerung  der  Zahl  der  StahilitdtsTcriterien  bei  Voraussetzung 

positiven  Icoeffizienten  der  charaTcteristischen  Gleichung,  Z.  Angew.  Math.  Mech., 

vol.  33,  pp.  222-27  (1953). 
[131]     Danilevskii,  a.  M.,  On  the  numerical  solution  of  the  secular  equation,  Mat.  Sb., 

vol.  2,  pp.  169-72  (1937). 
132]     Diliberto,  S.,  On  systems  of  ordinary  differential  operations.    In:  Contributions 

to  the  Theory  of  Non-linear  Oscillations,  vol.  I,  edited  by  S.  Lefschetz  (Annals 

of  Mathematics  Studies,  No.  20).     Princeton:   Princeton  Univ.  Press   (1950), 

pp.  1-38. 
133]     Dmitriev,  N.  a.  and  E.  B.  Dynkin,  On  the  characteristic  roots  of  stochastic 

matrices,  Dokl.  Akad.  Nauk  SSSR,  vol.  49,  pp.  159-62   (1945). 
'133a]  Characteristic  roots  of  Stochastic  Matrices,  Izv.  Akad.  Nauk,  Ser.  Fiz- 

Mat.,  vol.  10,  pp.  167-94  (1946). 
134]     DoBSCH,   O.,   Matrix funMionen   heschr'dnlcter   SchwanJcung,   Math.   Z.,   vol.   43, 

pp.  353-88  (1937). 

[135]     DONSKAYA,  L.  I.,  Construction  of  the  solution  of  a  linear  system  in  the  neighbor- 
hood of  a  regular  singularity  in  special  cases,  Vestnik  Leningrad.  Univ.,  vol.  6 

(1952). 

[136]     On  the  structure  of  the  solution  of  a  system  of  linear  differential  equa- 
tions in  the  neighbourhood  of  a  regular  singularity,  Vestnik  Leningrad.     Univ., 

vol.  8,  pp.  55-64  (1954). 
[137]     DuBNOV,  Y.  S.,  On  simultaneous  invariants  of  a  system  of  affinors.  Trans.  Math. 

Congress  in  Moscow  1927,  pp.  236-37. 
138]     On  doubly  symmetric  orthogonal  matrices.  Bull.  Ass.  Inst.  Univ.  Moscow, 

pp.  33-35  (1927). 

[139]     On  Dirac's  matrices,  Uc.  zap.  Univ.  Moscow,  vol.  2,  pp.  2,  43-48  (1934). 

[140]    'DuBNOV,  Y.  S.  and  V.  K.  Ivanov,  On  the  reduction  of  the  degree  of  affinor 

polynomials,  Dokl.  Akad.  Nauk  SSSR,  vol.  41,  pp.  99-102  (1943). 
[141]     Duncan,  W.,  Reciprocation  of  triply -partitioned  matrices,  J.  Roy.  Aero.  Soc, 

vol.  60,  pp.  131-32  (1956). 
142]     Egervary,  E.,  On  a  lemma  of  Stieltjes  on  matrices.  Acta.  Sci.  Math.,  vol.  15, 

pp.  99-103  (1954). 

143]     On  hypermatrices  whose  blocks  are  commutable  in  pairs  and  their  appli- 
cation in  lattice-dynamics.  Acta  Sci.  Math.,  vol.  15,  pp.  211-22  (1954). 
144]     Epstein,  M.  and  H.  Flanders,  On  the  reduction  of  a  matrix  to  diagonal  form, 

Amer.  Math.  Monthly,  vol.  62,  pp.  168-71  (1955). 
[145]     Ershov,  a.  p..  On  a  method  of  inverting  matrices,  Dokl.  Akad.  Nauk  SSSR, 

vol.  100,  pp.  209-11  (1955). 
[146]     Erugin,  N.  p.,  Sur  la  substitution  exposante  pour  quelques  systdmes  irreguliers. 

Mat.  Sb.,  vol.  42,  pp.  745-53  (1935). 
[147]     Exponential  substitutions  of  an  irregular  system  of  linear  differential 

equations,  Dokl.  Akad.  Nauk  SSSR,  vol.  17,  pp.  235-36   (1935). 


2  58  Bibliography 

*[148]     On  Biemann's  problem  for  a  Gaussian  system,  Uc.  Zap.  Ped.  Inst.,  vol.  28, 

pp.  293-304  (1939). 
*[149]     Faddeev,  D.  K.,  On  the  transformation  of  the  secular  equation  of  a  matrix, 

Trans.  Inst.  Eng.  Constr.,  vol.  4,  pp.  78-86  (1937). 
[150]     Taedo,  S.,  Un  nuove  problema  di  stahilita  per  le  equationi  algebriche  a  coeffi- 

cienti  reali,  Ann.  Scuola  Norm.  Sup.  Pisa,  vol.  7,  pp.  53-63  (1953). 
*[151]     Page,  M.  K.,  Generalization  of  Hadamard's  determinant  inequality,  Dokl.  Akad. 

Nauk  SSSE,  vol.  54,  pp.  765-68  (1946). 

*[152]     On  symmetrizahle  matrices,  Uspehi  Mat.  Nauk,  vol.  6,  no.  3,  pp.  153-56 

(1951). 
[153]     Pan,  K.,  On  a  theorem  of  Weyl  concerning  eigenvalues  of  linear  transformations, 

Proc.  Nat.  Acad.  Sci.  U.S.A.,  vol.  35,  pp.  652-55  (1949) ;  vol.  36,  pp.  31-35  (1950). 
[154]     Maximum  properties  and  inequalities  for  the  eigenvalues  of  completely 

continuous  operators,  Proc.  Nat.  Acad.  Sci.  U.S.A.,  vol.  37,  pp.  760-66  (1951). 
[155]     A  comparison  theorem  for  eigenvalues  of  normal  matrices.  Pacific  J. 

Math.,  vol.  5,  pp.  911-13  (1955). 
[156]     Some  inequalities  concerning  positive-definite  Hermitian  matrices,  Proc. 

Cambridge  Philos.  Soc,  vol.  51,  pp.  414-21  (1955). 
[157]     Topological  proofs  for  certain  theorems  on  matrices  with  non-negative 

elements,  Monatsh.  Math.,  vol.  62,  pp.  219-37  (1958). 
[158]     Fan,  K.  and  A.  Hoffman,  Some  metric  inequalities  in  the  spa^e  of  matrices,  Proc. 

Amer.  Math.  Soc,  vol.  6,  pp.  111-16  (1958). 
[159]     Fan,  K.  and  G.  Pall,  Imbedding  conditions  for  Hermitian  and  normal  matrices, 

Canad.  J.  Math.,  vol.  9,  pp.  298-304  (1957). 
[160]     Fan,  K.  and  J.  Todd,  A  determinantal  inequality,  J.  London  Math.  Soc,  vol.  30, 

pp.  58-64  (1955). 
[161]     Frobenius,   G.,    Pber  lineare   substitutionen  und   bilinear e  Formen,  J.   Eeine 

Angew.  Math.,  vol.  84,  pp.  1-63  (1877). 
[162]     t^ber   das   Trdgheitsgesetz   der  quadratischen  Formen,   S.-B.   Deutsch. 

Akad.  Wiss.  Berlin.  Math.-Nat.  Kl.,  1894,  pp.  241-56,  407-31. 
[163]     lyber  die  cogredienten  transformationen  der  bilinearer  Formen,  S.-B. 

Deutsch.  Akad.  Wiss.  Berlin.  Math.-Nat.  Kl.,  1896,  pp.  7-16. 
[164]     t^ber  die  vertauschbaren  Matrisen,  S.-B.  Deutsch.  Akad.  Wiss.  Berlin. 

Math.-Nat.  Kl.,  1896,  pp.  601-614. 
[165]     l^ber    Matrizen  aus  positiven  Elementen,   S.-B.   Deutsch.   Akad.  Wiss. 

Berlin.  Math-Nat.  Kl.  1908,  pp.  471-76 ;  1909,  pp.  514-18. 
[166]     tyber  Matrizen  aus  nicht  negativen  Elementen,  S.-B.  Deutsch.  Akad. 

Wiss.  Berlin  Math.-Nat.  Kl.,  1912,  pp.  456-77. 
*[167]     Gantmacher,  F.  E.,  Geometric  theory  of  elementary  divisors  after  Krull,  Trudy 

Odessa  Gos.  Univ.  Mat.,  vol.  1,  pp.  89-108  (1935). 
*[168]     On  the  algebraic  analysis  of  Erylov's  method  of  transforming  the  secular 

equation.  Trans.  Second  Math.  Congress,  vol.  II,  pp.  45-48  (1934). 
[169]     On  the  classification  of  real  simple  Lie  groups.  Mat.  Sb.,  vol.  5,  pp.  217-50 

(1939). 
*[170]     Gantmacher,  F.  E.  and  M.  G.  Krein,  On  the  structure  of  an  orthogonal  matrix, 

Trans.  Ukrain,  Acad.  Sci.  Phys.-Mat.  Kiev  (Trudy  fiz.-mat.  otdela  VUAN,  Kiev), 

1929,  pp.  1-8. 
*[171]     Normal  operators  in  a  hermitian  space,  Bull.  Phys-Mat.  Soc  Univ.  Kasan 

(Izvestiya  fiz.-mat.  ob-va  pri  Kazanskom  universitete),  IV,  vol.  1,  ser.  3,  pp.  71-84 

(1929-30). 


Bibliography  259 

*[172]     On  a  special  class  of  determinants  connected  with  Kellogg 's  integral 

kernels,  Mat.  Sb.,  vol.  42,  pp.  501-8  (1935). 
[173]     Sur  les  matrices  oscillatoires  et  comyletement  non-negatives,  Compositio 

Math.,  vol.  4,  pp.  445-76  (1937). 
[174]     Gantschi,  W.,  Bounds  of  matrices  with  regard  to  an  hermitian  metric,  Compositio 

Math.,  vol.  12,  pp.  1-16  (1954). 
*[175]     Gelfand,  I.  M.  and  V.  B.  Lidskii,  On  the  structure  of  the  domains  of  stability 

of  linear  canonical  systems  of  differential  equations  with  periodic  coefficients, 

Uspehi.  Mat.  Nauk,  vol.  10,  no.  1,  pp.  3-40  (1955). 
[176]     Gershqorin,  S.  A.,  t^ber  die  Abgrenzung  der  Eigenwerte  einer  Matrix,  Izv.  Akad, 

Nauk  SSSR,  Ser.  Fiz.-Mat.,  vol.  6,  pp.  749-54  (1931). 
[177]     GoDDARD,  L.,  An  extension  of  a  matrix  theorem  of  A.  Brauer,  Proc.  Int.  Cong. 

Math.  Amsterdam,  1954,  vol.  2,  pp.  22-23. 
[178]     GoHEEN,  H.  E.,  On  a  lemma  of  Stieltjes  on  matrices,  Amer.  Math.  Monthly, 

vol.  56,  pp.  328-29  (1949). 
*[179]     GOLUBCHIKOV,  A.  F.,  On  the  structure  of  the  automorphisms  of  the  complex  simple 

Lie  groups,  Dokl.  Akad.  Nauk  SSSR,  vol.  27,  pp.  7-9  (1951). 
*[180]     Grave,  D.  A.,  Small  oscillations  and  some  propositions  in  algebra,  Izv.  Akad. 

Nauk  SSSR,  Ser.  Fiz.-Mat.,  vol.  2,  pp.  563-70  (1929). 
*[181]     Grossman,  D.  P.,  On  the  problem  of  a  numerical  solution  of  systems  of  simul- 
taneous linear  algebraic  equations,  Uspehi  Mat.  Nauk,  vol.  5,  no.  3,  pp.  87-103 

(1950). 
[182]     Hahn,  W.,  Eine  BemerTcung  zur  sweiten  Methode  von  Lyapunov,  Math.  Nachr., 

vol.  14,  pp.  349-54  (1956). 
[183]     l^ber  die  Anwendung  der  Methode  von  Lyapunov  auf  Differenzengleich- 

ungen,  Math.  Ann.,  vol.  136,  pp.  430-41  (1958). 
[184]     Haynsworth,  E.,  Bounds  for  determinants  with  dominant  main  diagonal,  Duke 

Math.  J.,  vol.  20,  pp.  199-209  (1953). 
[185]     Note  on  bounds  for  certain  determinants,  Duke  Math.  J.,  vol.  24,  pp.  313- 

19  (1957). 
[186]     Hellmann,  O.,  Die  Anwendung  der  Matrizanten  bei  Eigenwertaufgaben,  Z. 

Angew.  Math.  Mech.,  vol.  35,  pp.  300-15  (1955). 
[187]     Hermite,  C,  Sur  le  nombre  des  racines  d'une  equation  algebrique  comprise  entre 

des  limites  donnees,  J.  Reine  Angew.  Math.,  vol.  52,  pp.  39-51  (1856). 
[188]     Hjelmsler,  J.,  Introduction  a  la  thSorie  des  suites  monotones,  Kgl.  Danske  Vid. 

Selbsk.  Forh.  1914,  pp.  1-74. 
[189]     Hoffman,  A.  and  O.  Taussky,  A  characterization  of  normal  matrices,  J.  Res. 

Nat.  Bur.  Standards,  vol.  52,  pp.  17-19  (1954). 
[190]     Hoffman,  A.  and  H.  Wielandt,  The  variation  of  the  spectrum  of  a  normal 

matrix,  Duke  Math.  J.,  vol.  20,  pp.  37-39  (1953). 
[191]     Horn,  A.,  On  the  eigenvalues  of  a  matrix  with  prescribed  singular  values,  Proc. 

Amer.  Math.  Soc,  vol.  5,  pp.  4-7  (1954). 
[192]     Hotelling,  H.,  Some  new  methods  in  matrix  calculation,  Ann.  Math.  Statist., 

vol.  14,  pp.  1-34  (1943). 
[193]     Householder,  A.  S.,  On  matrices  with  non-negative  elements,  Monatsh.  Math., 

vol.  62,  pp.  238-49  (1958). 
[194]     Householder,  A.  S.  and  F.  L.  Bauer,  On  certain  methods  for  expanding  the 

characteristic  polynomial,  Numer.  Math.,  vol.  1,  pp.  29-35  (1959). 
[195]     Hsu,  P.  L.,   On  symmetric,  orthogonal,  and  skew-symmetric  matrices,  Proc. 

Edinburgh  Math.  Soc,  vol.  10,  pp.  37-44  (1953). 


260  Bibliography 

[196]     On  a  Tcind  of  transformation  of  matrices,  Acta  Math.  Sinica,  vol.  5, 

pp.  333-47  (1955). 
[197]     HuA,  L.-K.,  On  the  theory  of  automorphic  functions  of  a  matrix  variable,  Amer. 

J.  Math.,  vol.  66,  pp.  470-88;  531-63  (1944). 
[198]     Geometries  of  matrices,  Trans.  Amer.  Math.  Soc,  vol.  57,  pp.  441-90 

(1945). 
[199]     Orthogonal  classification  of  Hermitian  matrices,  Trans.  Amer.  Math. 

Soc,  vol.  59,  pp.  508-23  (1946). 
*[200]     Geometries  of  symmetric  matrices  over  the  real  field,  Dokl.  Akad.  Nauk 

SSSR,  vol.  53,  pp.  95-98;  195-96  (1946). 
*[201]     Automorphisms  of  the  real  symplectic  group,  Dokl.  Akad.  Nauk  SSSR, 

vol.  53,  pp.  303-306  (1946). 
[202]     Inequalities  involving  determinants.  Acta  Math.  Sinica,  vol.  5,  pp.  463-70 

(1955). 
*[203]     HuA,  L.-K.  and  B.  A.  Rosenfeld,  The  geometry  of  rectangular  matrices  and  their 

application  to  the  real  projective  and  non-euclidean  geometries,  Izv.  Higher  Ed. 

SSSR,  Matematika,  vol.  1,  pp.  233-46  (1957). 
[204]     HuRWiTZ,  A.,  Pher  die  Bedingungen,  unter  welchen  eine  Gleichung  nur  Wurzeln 

mit  negativen  reellen  Teilen  hesitzt.  Math.  Ann.,  vol.  46,  pp.  273-84  (1895). 
[205]     Ingraham,  M.  H.,  On  the  reduction  of  a  matrix  to  its  rational  canonical  form. 

Bull.  Amer.  Math.  Soc,  vol.  39,  pp.  379-82  (1933). 
[206]     lONESCU,  D.,  0  identitate  importantd  si  descompunere  a  unci  forme  hilineare  into 

sumd  de  produse,  Gaz.  Mat.  Ser.  Fiz.  A.  7,  vol.  7,  pp.  303-312  (1955). 
[207]     ISHAK,  M.,  Sur  les  spectres  des  matrices,  Sem.  P.  Dubreil  et  Ch.  Pisot,  Fac.  Sci. 

Paris,  vol.  9,  pp.  1-14  (1955/56). 
*[208]     Kagan,  V.  F.,  On  some  number  systems  arising  from  Lorentz  transformations, 

Izv.  Ass.  Inst.  Moscow  Univ.  1927,  pp.  3-31. 
*[209]     Karpelevich,  F.  I.,  On  the  eigenvalues  of  a  matrix  with  non-negative  elements, 

Izv.  Akad.  Nauk  SSSR  Ser.  Mat.,  vol.  15,  pp.  361-83  (1951). 
[210]     Khan,  N.  A.,  The  characteristic  roots  of  a  product  of  matrices,  Quart.  J.  Math., 

vol.  7,  pp.  138-43  (1956). 
*[211]     Khlodovskii,  I.  N.,  On. the  theory  of  the  general  case  of  Krylov's  transforma- 
tion of  the  secular  equation,  Izv.  Akad.  Nauk,  Ser.  Fiz.-Mat.,  vol.  7,  pp.  1076-1102 

(1933). 
*[212]     Kolmogorov,  A.  N.,  MarTcov  chains  with  countdbly  many  possible  states,  Bull. 

Univ.  Moscow  (A),  vol.  1:3  (1937). 
*[213]     Kotelyanskii,  D.  M.,  On  monotonic  matrix  functions  of  order  n.  Trans.  Univ. 

Odessa,  vol.  3,  pp.  103-114  (1941). 
*[214]     On  the  theory  of  non-negative  and  oscillatory  matrices,  Ukrain.  Mat.  Z., 

vol.  2,  pp.  94-101  (1950). 
*[215]     On  some  properties  of  matrices  with  positive  elements.  Mat.  Sb.,  vol.  31, 

pp.  497-506  (1952). 
•  [216]     On  a  property  of  matrices  of  symmetric  signs,  Uspehi  Mat.  Nauk,  vol.  8, 

no.  4,  pp.  163-67  (1953). 
*[217]     On  some  sufficient  conditions  for  the  spectrum  of  a  matrix  to  be  real 

and  simple,  Mat.  Sb.,  vol.  36,  pp.  163-68  (1955). 
*[218]     On  the  influence  of  Gauss'  transformation  on  the  spectra  of  matrices, 

Uspehi  Mat.  Nauk,  vol.   9,  no.  3,  pp.  117-21  (1954). 
*[219]     On  the  distribution  of  points  on  a  matrix  spectrum,  Ukrain.  Mat.  Z., 

vol.  7,  pp.  131-33  (1955). 


Bibliography  261 

*[220] Estimates  for  determinants  of  matrices  with  dominant  main  diagonal, 

Izv.  Akad.  Nauk  SSSR,  Ser.  Mat.,  vol.  20,  pp.  137-44  (1956). 
*[221]     KovALENKO,  K.  R.  and  M.  G.  Kre!in,  On  some  investigations  of  Lyapunov  con- 
cerning  differential   equations   with   periodic   coefficients,   Dokl.    Akad.    Nauk 

SSSR,  vol.  75,  pp.  495-99  (1950). 
[222]     KowALEWSKi,  G.,  Natiirliche  Normalformen  linear er  Transformationen,  Leipz. 

Ber.,  vol.  69,  pp.  325-35  (1917). 
*[223]     Krasovskii,  N.  N.,  On  the  stability  after  the  first  approximation,  Prikl.  Mat. 

Meh.,  vol.  19,  pp.  516-30  (1955). 
*[224]     Krasnosel 'SKii,  M.  A,  and  M.  G.  Krein,  An  iteration  process  with  minimal 

deviations.  Mat.  Sb.,  vol.  31,  pp.  315-34  (1952). 
[225]     Kraus,  F.,  Vher  Jconvexe  Matrix funTctionen,  Math.  Z.,  vol.  41,  pp.  18-42  (1936). 
*[226]     Kravchuk^  M.  F.,  On  the  general  theory  fif  bilinear  forms,  Izv.  Polyt.  Inst. 

Kiev,  vol.  19,  pp.  17-18  (1924). 
*[227]     On  the  theory  of  permutdble  matrices,  Zap.  Akad.  Nauk  Kiev,  Ser.  Fiz.- 

Mat.,  vol.  1:2,  pp.  28-33  (1924). 
*[228]     On  a  transformation  of  quadratic  forms.  Zap.  Akad.  Nauk  Kiev,  Ser. 

Fiz.-Mat.,  vol.  1:2,  pp.  87-90  (1924). 
*[229]     On  quadratic  forms  and  linear  transformations.  Zap.  Akad.  Nauk  Kiev, 

Ser.  Fiz.-Mat.,  vol.  1:3,  pp.  1-89  (1924). 
*[230]     Permutahle  sets  of  linear  transformations,  Zap.  Agr.  Inst.  Kiev,  vol.  1, 

pp.  25-58  (1926). 
[231]     t^ber  vertauschbare  Matrizen,  Rend.  Circ.  Mat.  Palermo,  vol.  51,  pp.  126- 

30  (1927). 
*[232]     On  the  structure  of  permutdble  groups  of  matrices,  Trans.  Second.  Mat. 

Congress  1934,  vol.  2,  pp.  11-12. 
*[233]     Kravchuk,  M.  F.  and  Y.  S.  Gol'dbaum,  On  groups  of  commuting  matrices, 

Trans.  Av.  Inst.  Kiev,  1929,  pp.  73-98 ;  1936,  pp.  12-23. 
*[234]     On  the  equivalence  of  singular  pencils  of  matrices,  Trans.  Av.  Inst.  Kiev, 

1936,  pp.  5-27. 
*[235]     Krein,  M.  G.,  Addendum  to  the  paper  *  On  the  structure  of  an  orthogonal  matrix,* 

Trans.  Fiz.-Mat.  Class.  Akad.  Nauk  Kiev,  1931,  pp.  103-7. 
*[236]     On  the  spectrum  of  a  Jacobian  form  in  connection  with  the  theory  of 

torsion  oscillations  of  drums,  Mat.  Sb.,  vol.  40,  pp.  455-66  (1933). 
♦  [237]     On  a  new  class  of  hermitian  forms,  Izv.  Akad.  Nauk  SSSR,  Ser.  Fiz.-Mat., 

vol.  9,  pp.  1259-75  (1933). 
*[238]     On  the  nodes  of  harmonic  oscillations  of  mechanical  systems  of  a  special 

type.  Mat.  Sb.,  vol.  41,  pp.  339-48  (1934). 
[239]     Sur  quelques  applications  des  noyaux  de  Kellog  aux  problemes  d'oscilla- 

tions,  Proc.  Charkov  Mat.  Soc.  (4),  vol.  11,  pp.  3-19  (1935). 
[240]     Sur  les  vibrations  propres  des  tiges  dont  I'une  des  extremites  est  encastree 

et  V autre  libre,  Proc.  Charkov.  Mat.  Soc.  (4),  vol.  12,  pp.  3-11  (1935). 
*[241]     Generalization  of  some  results  of  Lyapunov  on  linear  differential  equa- 
tions with  periodic  coefficients,  Dokl.  Akad.  Nauk  SSSR,  vol.  73,  pp.  445-48 

(1950). 
*[242]     On  an  application  of  the  fixed-point  principle  in  the  theory  of  linear 

transformations  of  spaces  with  indefinite  metric,  Uspehi  Mat.  Nauk,  vol.  5,  no.  2, 

pp.  180-90   (1950). 
*[243]     On  an  application  of  an  algebraic  proposition  in  the  theory  of  monodromy 

matrices,  Uspehi  Mat.  Nauk,  vol.  6,  no.  1,  pp.  171-77  (1951). 


262  Bibliography 

*[244]     On  some  problems  concerning  Lyapunov's  ideas  in  the  theory  of  stability, 

Uspehi  Mat.  Nauk,  vol.  3,  no.  3,  pp.  166-69  (1948). 
*[245]     On  the  theory  of  integral  matrix  functions  of  exponential  type,  Ukrain. 

Mat.  Z.,  vol.  3,  pp.  164-73  (1951). 
•[246]     On  some  problems  in  the  theory  of  oscillations  of  Sturm  systems,  Prikl. 

Mat.  Meh.,  vol.  16,  pp.  555-68  (1952). 
*[247]     Krein,  M.  G.  and  M.  A.  Na^Mark   (Neumark),  On  a  transformation  of  the 

Bezoutian  leading  to  Sturm's  theorem,  Proc.  Charkov  Mat.  Soc,   (4),  vol.  10, 

pp.  33-40  (1933). 
*[248]     On  the  application  of  the  Bezoutian  to  problems  of  the  separation  of 

roots  of  algebraic  equations,  Trudy  Odessa  Gos.  Univ.  Mat.,  vol.  1,  pp.  51-69 

(1935). 
[249]     Kronecker,  L.,  Algebraische  deduction  der  Schaaren  bilinearer  Formen,  S.-B. 

Akad.  Berlin  1890,  pp.  763-76. 
[250]     Krull,  W.,  Theorie  und  Anwendung  der  verallgemeinerten  Abelschen  Gruppen, 

S.-B.  Akad.  Heidelberg  1926,  p.  1. 
*[251]     Krylov,  a.  N.,  On  the  numerical  solution  of  the  equation  by  which  the  frequency 

of  small  oscillations  is  determined  in  technical  problems,  Izv.  Akad.  Nauk  SSSR 

Ser.  Fiz.-Mat.,  vol.  4,  pp.  491-539  (1931). 
[252]     Lappo-Danilevskii,  I.  A.,  Resolution  algorithmique  des  problemes  reguliers  de 

Poincare  et  de  Eiemann,  J.  Phys.  Mat.  Soc.  Leningrad,  vols.  2:1,  pp.  94-120; 

121-54  (1928). 
[253]     Theorie  des  matrices  satisfaisantes  a  des  systemes  des  equations  diffe- 

rentielles  lineaires  a  coefficients  rationnels  arbitraires,  J.  Phys.  Mat.  Soc.  Lenin- 
grad, vols.  2:2,  pp.  41-80  (1928). 
*[254]     Fundamental  problems  in  the  theory  of  systems  of  linear  differential 

equations  with  arbitrary  rational  coefficients,  Trans.  First  Math.  Congr.,  ONTI, 

1936,  pp.  254-62. 
[255]     Ledermann,  W.,  Beduction  of  singular  pencils  of  matrices,  Proc.  Edinburgh 

Math.  Soc,  vol.  4,  pp.  92-105  (1935). 
[256]     Bounds  for  the  greatest  latent  root  of  a  positive  matrix,  J.  London  Math. 

Soc,  vol.  25,  pp.  265-68  (1950). 
*[257]     LiDSKii,  V.  B.,  On  the  characteristic  roots  of  a  sum  and  a  product  of  symmetric 

matrices,  Dokl.  Akad.  Nauk  SSSE,  vol.  75,  pp.  769-72  (1950). 
*[258]     Oscillation  theorems  for  canonical  systems  of  differential  equations, 

Dokl.  Akad.  Nauk  SSSR,  vol.  102,  pp.  111-17  (1955). 
[259]     LiENARD,  and  Chipart,  Sur  la  signe  de  la  partie  reelle  des  racines  d'une  equation 

algebrique,  J.  Math.  Pures  Appl.  (6),  vol.  10,  pp.  291-346  (1914). 
*[260]     LiPiN,  N.  v..  On  regular  matrices,  Trans.  Inst.  Eng  8.  Transport,  vol.  9,  p.  105 

(1934). 
*[261]     LivsHiTZ,  M.  S.  and  V.  P.  Potapov,  The  multiplication  theorem  for  characteristic 

matrix  functions,  Dokl.  Akad.  Nauk  SSSR,  vol.  72,  pp.  164-73  (1950). 
*[262]     LoPSHiTZ,  A.  M.,  Vector  solution  of  a  problem  on  doubly  symmetric  matrices. 

Trans.  Math.  Congress  Moscow,  1927,  pp.  186-87. 
*[263]     The  characteristic  equation  of  lowest  degree  for  affinors  and  its  appli- 
cation to  the  integration  of  differential  equations,  Trans.  Sem.  Vectors  and  Ten- 
sors, vols.  2/3  (1935). 
*[264]     A  numerical  method  of  determining  the  characteristic  roots  and  charac- 
teristic planes  of  a  linear  operator.  Trans.  Sem.  Vectors  and  Tensors,  vol.  7, 

pp.  233-59  (1947). 


Bibliography  263 

*[265]     An  extremal  theorem  for  a  hyper -ellipsoid  and  its  application   to   the 

solution  of  a  system  of  linear  algebraic  equations,  Trans.  Sem.  Vectors  and  Ten- 
sors, vol.  9,  pp.  183-97  (1952). 
[266]     LowNER,  K.,  t)ber  monotone  Matrixfunktionen,  Math.  Z.,  vol.  38,  pp.  177-216 

(1933) ;  vol.  41,  pp.  18-42  (1936). 
[267]     Some  classes  of  functions  defined  by  difference  on  differential  inequali- 
ties, Bull.  Amer.  Math.  Soc,  vol.  56,  pp.  308-19  (1950). 
*[268]     LusiN,  N.  N.,  On  Krylov's  method  of  forming  the  secular  equation,  Izv.  Akad. 

Nauk  SSSK,  Ser.  Fiz.-Mat.,  vol.  7,  pp.  903-958  (1931). 
*[269]     On  some  properties  of  the  displacement  factor  in  Krylov's  method,  Izv. 

Akad.  Nauk  SSSR,  Ser.  Fiz.-Mat.,  vol.  8,  pp.  596-638;  735-62;  1065-1102  (1932). 
*[270]     On  the  matrix  theory  of  differential  equations,  Avtomat.  i  Telemeh,  vol.  5, 

pp.  3-66  (1940). 
*[271]     Lyusternik,  L.  A.,  The  determination  of  eigenvalues  of  functions  by  an  electric 

scheme,  Electricestvo,  vol.  11,  pp.  67-8  (1946). 
*[272]     On  electric  models  of  symmetric  matrices,  Uspehi  Mat.  Nauk,  vol.  4, 

no.  2,  pp.  198-200  (1949). 
*[273]     Lyusternik,  L.  A.  and  A.  M.  Prokhorov,  Determination  of  eigenvalues  and 

functions  of  certain  operators  by  means  of  an  electrical  networic,  Dokl.  Akad 

Nauk  SSSR,  vol.  55,  pp.  579-82;   Izv.  Akad.  Nauk  SSSE,  Ser.  Mat.,  vol.  11, 

pp.  141-45  (1947). 
[274]     Marcus,  M.,  A  remarTc  on  a  norm  inequality  for  square  matrices,  Proe.  Amer. 

Math.  Soc,  vol.  6,  pp.  117-19  (1955). 
[275]     An  eigenvalue  inequality  for  the  product   of  normal  matrices,   Amer. 

Math.  Monthly,  vol.  63,  pp.  173-74  (1956). 
[276]     A  determinantal  inequality  of  H.  P.  Robertson,  II,  J.  Washington  Acad. 

Sci.,  vol.  47,  pp.  264-66  (1957). 
[277]     Convex  functions  of  quadratic  forms,  Duke  Math.  J.,  vol.  24,  pp.  321-26 

(1957). 
[278]     Marcus,  M.  and  J.  L.  McGregor,  Extremal  properties  of  Hermitian  matrices, 

Canad.  J.  Math.,  vol.  8,  pp.  524-31  (1956). 
[279]     Marcus,  M.  and  B.  N.  Moyls,  On  the  maximum  principle  of  Ky  Fan,  Canad. 

J.  Math.,  vol.  9,  pp.  313-20  (1957). 
[280]     Maximum  and  minimum  values  for  the  elementary  symmetric  functions 

of  Hermitian  forms,  J.  London  Math.  Soc,  vol.  32,  pp.  374-77  (1957). 
*[281]     May  ANTS,  L.  S.,  A  method  for  the  exact  determination  of  the  roots  of  secular 

equations  of  high  degree  and  a  numerical  analysis  of  their  dependence  on  the 

parameters  of  the  corresponding  matrices,  Dokl.  Akad.  Nauk  SSSR,  vol.  50, 

pp.  121-24  (1945). 
[282]     MiRSKY,  L.,  An  inequality  for  positive-definite  matrices,  Amer.  Math.  Monthly, 

vol.  62,  pp.  428-30  (1955). 
[283]     The  norm  of  ad  jugate  and  inverse  matrices.  Arch.  Math.,  vol.  7,  pp.  276-77 

(1956). 

[284]     The  spread  of  a  matrix,  Mathematika,  vol.  3,  pp.  127-30   (1956). 

[285]     Inequalities  for  normal  and  Hermitian  matrices,  Duke  Math.  J.,  vol.  24, 

pp.  591-99  (1957). 
[286]     MiTROVic,  D.,  Conditions  graphiques  pour  que  toutes  les  racines  d'une  equation 

algebrique  soient  a  parties  reelles  negatives,  C.  R.  Acad.  Sci.  Paris,  vol.  240, 

pp.  1177-79  (1955). 
[287]     MoRGENSTERN,    D.,    Eine    Verschdrfung    der    Ostrowskischen    Determinanten- 

abschdtzung,  Math.  Z.,  vol.  66,  pp.  143-46  (1956). 


264  Bibliography 

[288]     MoTZKiN,  T.  and  O.  Taussky,  Pairs  of  matrices  with  property  L.,  Trans.  Amer. 
Math.  Soc,  vol.  73,  pp.  108-14  (1952)  ;  vol.  80,  pp.  387-401  (1954). 
*[289]     Neigaus  (Neuhaus),  M.  G.  and  V.  B.  Lidskii,  On  the  'boundedness  of  the  solu- 
tions of  linear  systems  of  differential  equations  with  periodic  coefficients,  Dokl. 
Akad.  Nauk  SSSE,  vol.  77,  pp.  183-93   (1951). 

[290]     Neumann,  J.,  Approximative  of  matrices  of  high  order,  Portugal.  Math.,  vol.  3, 
pp.  1-62  (1942). 
*[291]     Nudel'man,  a.  a.  and  P.  A.  Shvartsman,  On  the  spectrum  of  the  product  of 
unitary  matrices,  Uspehi  Mat.  Nauk,  vol.  13,  no.  6,  pp.  111-17  (1958). 

[292]     Okamoto,  M.,  On  a  certain  type  of  matrices  with  an  application  to  experimental 
design,  Osaka  Math.  J.,  vol.  6,  pp.  73-82  (1954). 

[293]     Oppenheim,  A.,  Inequalities  connected  with  definite  Hermitian  forms,  Amer. 
Math.  Monthly,  vol.  61,  pp.  463-66  (1954). 

[294]     Orlando,  L.,  Sul  prohlema  di  Hurwitz  relativo  alle  parti  reali  delle  radici  di  un' 
equatione  algehrica,  Math.  Ann.,  vol.  71,  pp.  233-45   (1911). 

[295]     OsTROWSKi,  A.,  Bounds  for  the  greatest  latent  root  of  a  positive  matrix,  J.  Lon- 
don Math.  Soc,  vol.  27,  pp.  253-56  (1952). 

[296]     Sur  quelques  applications  des  fonctions  convexes  et  concaves  au  sens  de 

I.  Schur,  J.  Math.  Pures  Appl.,  vol.  31,  pp.  253-92  (1952). 

[297]     On  nearly  triangular  matrices,  J.  Kes.  Nat.  Bur.  Standards,  vol.  52, 

pp.  344-45  (1954). 

[298]     On  the  spectrum  of  a  one-parametric  family  of  matrices,  J.  Eeine  Angew. 

Math.,  vol.  193,  pp.  143-60  (1954). 

[299]     Sur  les  determinants  a  diagonale  dominante,  Bui.  Soc.  Math.  Belg.,  vol.  7, 

pp.  46-51  (1955). 

[300]     Note  on  hounds  for  some  determinants,  Duke  Math.  J.,  vol.  22,  pp.  95-102 

(1955). 

[301]     Pher  Normen  von  Matrizen,  Math.  Z.,  vol.  63,  pp.  2-18  (1955). 

[302]     l^her  die  Stetiglceit  von  charaMeristischen  Wurzeln  in  AbhangigTceit  von 

den  Matrizenelementen,  Jber.  Deutsch.  Math.  Verein.,  vol.  60,  pp.  40-42  (1957). 
*[303]     Papkovich,  P.  F.,  On  a  method  of  computing  the  roots  of  a  characteristic  deter- 
minant, Prikl.  Mat.  Meh.,  vol.  1,  pp.  314-18  (1933). 

[304]     Papulis,  a.,  Limits  on  the  zeros  of  a  networTc  determinant,  Quart.  Appl.  Math., 
vol.  15,  pp.  193-94  (1957). 

[305]     Parodi,  M.,  Bemarques  sur  la  staMlite,  C.  R.  Acad.  Sei.  Paris,  vol.  228,  pp.  51-2; 
807-8;  1198-1200  (1949). 

[306]     Sur  une  propriete  des  racines  d'une  equation  qui  intervient  en  mecanique, 

C.  R.  Acad.  Sci.  Paris,  vol.  241,  pp.  1019-21  (1955). 

[307]     Sur  la  localisation  des  valeurs  caracteristiques  des  matrices  darCs  le  plan 

complexe,  C.  R.  Acad.  Sci.  Paris,  vol.  242,  pp.  2617-18  (1956). 

[308]     Peano,  G.,  Integration  par  series  des  equations  differentielles  lineaires.  Math. 
Ann.,  vol.  32,  pp.  450-56  (1888). 

[309]     Penrose,  R.,  A  generalized  inverse  far  matrices,  Proc.  Cambridge  Philos.  Soc, 
vol.  51,  pp.  406-13   (1955). 

[310]     On  best  approximate  solutions  of  linear  matrix  equations,  Proc.  Cam- 
bridge Philos.  Soc,  vol.  52,  pp.  17-19  (1956). 

[311]     Perfect,  H.,  On  matrices  with  positive  elements.  Quart.  J.  Math.,  vol.  2,  pp.  286- 
90  (1951). 

[312]     On  positive  stochastic  matrices  with  real  characteristic  roots,  Proc.  Cam- 
bridge Philos.  Soc,  vol.  48,  pp.  271-76  (1952). 


Bibliography  265 

[313]     Methods  of  constructing   certain   stochastic   matrices,  Duke   Math.  J., 

vol.  20,  pp.  395-404  (1953);  vol.  22,  pp.  305-11   (1955). 
[314]     A  lower  bound  for  the  diagonal  elements  of  a  non-negative  matrix,  J. 

London  Math.  Soc,  vol.  31,  pp.  491-93  (1956). 
[315]     Perron,  O.,  Jacobischer  Eettenbruchalgorithmus,  Math.  Ann.,  vol.  64,  pp.  1-76 

(1907). 

[316]     mer  Matrizen,  Math.  Ann.,  vol.  64,  pp.  248-63  (1907). 

[317]     iyber  Stabilitdt  und  asymptotisches  Verhalten  der  Losungen  eines  Systems 

endlicher  Differenzengleichungen,  J.  Eeine  Angew.  Math.,  vol.  161,  pp.  41-64 

(1929). 
[318]     Phillips,  H.  B.,  Functions  of  matrices,  Amer.  J.  Math.,  vol.  41,  pp.  266-78  (1919). 
'[319]     PoNTRYAGiN,  L.  S.,  Hermitian  operators  in  a  space  with  indefinite  metric,  Izv. 

Akad.  Nauk  SSSR,  Ser.  Mat.,  vol.  8,  pp.  243-80  (1944). 
^[320]     POTAPOV,  V.  P.,  On  holomorphic  matrix  functions  bounded  in  the  unit  circle, 

Dokl.  Akad.  Nauk  SSSE,  vol.  72,  pp.  849-53  (1950). 
[321]     Rasch,  G.,  Zur  Theorie  und  Anwendung  der  ProduTctintegrals,  J.  Reine  Angew. 

Math.,  vol.  171,  pp.  65-119  (19534). 
[322]     Reichardt,  H.,  Einfache  Eerleitung  der  Jordanschen  Normalform,  Wiss.   Z. 

Humboldt-Univ.  Berlin.  Math.-Nat.  Reihe,  vol.  6,  pp.  445-47  (1953/54). 
[323]     Rechtman-Ol'shanskaya,  P.  G.,  On  a  theorem  of  MarTcov,  Uspehi  Mat.  Nauk, 

vol.  12,  no.    3,  pp.  181-87  (1957). 
[324]     Rham,  G.  de,  Sur  un  theoreme  de  Stieltjes  relatif  a  certains  matrices,  Acad. 

Serbe  Sci.  Publ.  Inst.  Math.,  vol.  4,  pp.  133-54  (1952). 
[325]     Richter,  H.,  t^ber  Matrixfunktionen,  Math.  Ann.,  vol.  122,  pp.  16-34  (1950). 
[326]     BemerJcung  zur  Norm  der  Inversen  einer  Matrix,  Arch.  Math.,  vol.  5, 

pp.  447-48  (1954). 
[327]     Zur  Abschatzung  von  Matrizennormen,  Math.  Nachr.,  vol.  18,  pp.  178-87 

(1958). 
[328]     RoMANOVSKii,  V.  I.,  Un  thioreme  sur  les  zeros  des  matrices  non-negatives.  Bull. 

Soc.  Math.  France,  vol.  61,  pp.  213-19  (1933). 
[329]     Becherches  sur  les  chaines  de  MarTcoff,  Acta  Math.,  vol.  66,  pp.  147-251 

(1935). 
[330]     Roth,  W.,  On  the  characteristic  polynomial  of  the  product  of  two  matrices,  Proc. 

Amer.  Math.  Soc,  vol.  5,  pp.  1-3  (1954). 
[331]     On  the  characteristic  polynomial  of  the  product  of  several  matrices,  Proc. 

Amer.  Math.  Soc,  vol.  7,  pp.  578-82  (1956). 
[332]     Roy,  S.,  A  useful  theorem  in  matrix  theory,  Proc.  Amer.  Math.  Soc,  vol.  5, 

pp.  635-38  (1954). 
^[333]     Sakhnovich,  L.  A.,  On  the  limits  of  multiplicative  integrals,  Uspehi  Mat.  Nauk, 

vol.  12  no.  3,  pp.  205-11  (1957). 
^[334]     Sarymsakov,  T.  A.,  On  sequences  of  stochastic  matrices,  Dokl.  Akad.  Nauk,  vol. 

47,  pp.  331-33  (1945). 
[335]     Schneider,  H.,  An  inequality  for  latent  roots  applied  to  determinants  with  domi- 
nant principal  diagonal,  J.  London  Math.  Soc,  vol.  28,  pp.  8-20  (1953). 
[336]     A  pair  of  matrices  with  property  P,  J.  Amer.  Math.  Monthly,  vol.  62, 

pp.  247-49  (1955). 
[337]     A  matrix  problem  concerning  projections,  Proc  Edinburgh  Math.  Soc, 

vol.  10,  pp.  129-30  (1956). 
[338]     The   elementary   divisors,  associated   with    0,  of  a  singular  M-matrix, 

Proc.  Edinburgh  Math.  Soc,  vol.  10,  pp.  108-22  (1956). 


266  Bibliography 

[339]     ScHOENBERG,  J.,  V^ber  variationsvermindernde  Uneare  Transformationen,  Math. 

Z.,  vol.  32,  pp.  321-28  (1930). 
[340]     Zur  Ahzahlung  der  reellen  Wurzeln  algehraischer  Gleichungen,  Math.  Z., 

vol.  38,  p.  546  (1933). 
[341]     ScHOENBERG,  I.  J.,  and  A.  Whitney,  A  theorem  on  polygons  in  n  dimensions 

with  application  to  variation  diminishing   linear  transformations,  Compositio 

Math.,  vol.  9,  pp.  141-60  (1951). 
[342]     ScHUR,,!.,  Pier  die  charaTcteristischen  Wurzeln  einer  linearen  Substitution  mit 

einer  Anwendung  auf  die  theorie  der  Integralgleichungen,  Math.  Ann.,  vol.  66, 

pp.  488-510  (1909). 
*[343]     Semendyaev,  K.  A.,  On  the  determination  of  the  eigenvalues  and  invariant  mani- 
folds of  matrices  by  means  of  iteration,  Prikl.  Matem.  Meh.,  vol.  3,  pp.  193-221 

(1943). 
*[344]     Sevast 'yanov,  B.  A.,  The  theory  of  branching  random  processes,  Uspehi  Mat. 

Nauk,  vol.  6,  no.  6,  pp.  46-99  (1951). 
*[345]     Shiffner,  L.  M.,  The  development  of  the  integral  of  a  system  of  differential 

equations  with  regular  singularities  in  series   of  powers  of  the  elements  of  the 

differential  substitution,  Trudy  Mat.  Inst.  Steklov.  vol.  9,  pp.  235-66   (1935). 

*[346]     On  the  powers  of  matrices.  Mat.  Sb.,  vol.  42,  pp.  385-94  (1935). 

[347]     Shoda,  K.,  t^ber  mit  einer  Matrix  vertauschbare  Matrizen,  Math.  Z.,  vol.  29, 

pp.  696-712  (1929). 
*[348]     Shostak,  p.  Y.,  On  a  criterion  for  the  conditional  definiteness  of  quadratic  form^ 

in  n  linearly  independent  variables  and  on  a  sufficient  condition  for  a  conditional 

extremum  of  a  function  of  n  variables,  Uspehi  Mat.  Nauk,  vol.  8,  no.  4,  pp.  199-206 

(1954). 
*[349]     Shreider,  Y.  A.,  A  solution  of  systems  of  linear  algebraic  equations,  Dokl.  Akad. 

Nauk,  vol.  76,  pp.  651-55  (1950). 
*[350]     Shtaerman    (Steiermann),  I.  Y.,  A  new  method  for  the  solution  of  certain 

algebraic  equations  which  have  application  to  mathematical  physics,  Z.  Mat., 

Kiev,  vol.  1,  pp.  83-89  (1934)  ;  vol.  4,  pp.  9-20  (1934). 
*[351]     Shtaerman  (Steiermann),  I.  Y.  and  N.  I.  Akhieser  (Achieser),  On  the  theory 

of  quadratic  forms,  Izv.  Polyteh.,  Kiev,  vol.  19,  pp.  116-23  (1934). 
*  [352]     Shura-Bura,  M.  E.,  An  estimate  of  error  in  the  numerical  computation  of  inatrices 

of  high  order,  Uspehi  Mat.  Nauk,  vol.  6,  no.  4,  pp.  121-50  (1951). 
*[353]     Shvartsman  (Schwarzmann),  A.  P.,  On  Green's  matrices  of  self-adjoint  differ- 
ential operators,  Proc.  Odessa  Univ.  Matematika,  vol.  3,  pp.  35-77  (1941). 
[354]     SlEGEL,  C.  L.,  Symplectic  Geometry,  Amer.  J.  Math.,  vol.  65,  pp.  1-86  (1943). 
*[355]     Skal  'kin a,  M.  A.,  On  the  preservation  of  asymptotic  stability  on  transition  from 

differential  equations  to  the  corresponding  difference  equations,  Dokl.  Akad. 

Nauk  SSSE,  vol.  104,  pp.  505-8,  (1955). 
*[356]     Smogorzhevskii,  A.  S.,  Sur  les  matrices  unitaires  du  type  de  circulants,  J.  Mat. 

Circle  Akad.  Nauk  Kiev,  vol.  1,  pp.  89-91  (1932). 
*[356a]  Smogorzhevskii,  A.  S.  and  M.  F.  Kravchuk,  On  orthogonal  transformations. 

Zap.  Inst.  Ed.  Kiev,  vol.  2,  pp.  151-56  (1927). 
[357]     Stenzel,  H.,  Pber  die  BarstellbarTceit  einer  Matrix  als  Produkt  von  zwei  symmet- 

rischen  Matrizen,  Math.  Z.,  vol.  15,  pp.  1-25  (1922). 
[358]     Stohr,  a.,  Oszillationstheoreme  fur  die  EigenveJctoren  spezieller  Matrizen,  J. 

Reine  Angew.  Math.,  vol.  185,  pp.  129-43  (1943). 
*[359]     SuLEiMANOVA,  K.  E.,  stochastic  matrices  with  real  characteristic  values,  Dokl. 

Akad.  Nauk  SSSE,  vol.  66,  pp.  343-45  (1949). 


Bibliography  267 

*[360]     On  the  characteristic  values  of  stochastic  matrices,  XJ6.  Zap.  Moscow 

Ped.  Inst.,  Ser.  71,  Math.,  vol.  1,  pp.  167-97  (1953). 
*[361]     SuLTANOV,  R.  M.,  Some  properties  of  matrices  with  elements  in  a  non- commutative 

ring,  Trudy  Mat.  Sectora  Akad.  Nauk  Baku,  vol.  2,  pp.  11-17  (1946). 
*[362]     SusHKEViCH,  A.  K.,  On  matrices  of  a  special  type,  U6.  Zap.  Univ.  Charkov, 

vol.  10,  pp.  1-16  (1937). 
[363]     Sz-Nagy,  B.,  Remark  on  S.  N.  Boy's  paper  *  A  useful  theorem  in  matrix  theory,' 

Proc.  Amer.  Math.  Soc,  vol.  7,  p.  1  (1956). 
[364]     Ta  Li,  Die  Stabilitdtsfrage  bei  Differemengleichungen,  Acta  Math.,  vol.  63, 

pp.  99-141  (1934). 
[365]     Taussky,  O.,  Bounds  for  characteristic  roots  of  matrices,  Duke  Math.  J.,  vol.  15, 

pp.  1043-44  (1948). 
[366]     A  determinantal  inequality  of  H.  P.  Robertson,  I,  J.  Washington  Acad. 

Sci.,  vol.  47,  pp.  263-64  (1957). 
[367]     Commutativity  in  finite  matrices,  Amer.  Math.  Monthly,  vol.  64,  pp.  229- 

35  (1957). 
[368]     ToEPLiTZ,  O.,  Das  algebraische  Analogon  zu  einem  Satz  von  Fejer,  Math.  Z., 

vol.  2,  pp.  187-97  (1918). 
[369]     TuRNBULL,  H.  W.,  On  the  reduction  of  singular  matrix  pencils,  Proc.  Edinburgh 

Math.  Soc,  vol.  4,  pp.  67-76  (1935). 
*[370.]     TuRCHANiNOV,  A.  S.,  On  some  applications  of  matrix  calculus  to  linear  differen- 
tial equations,  U6.  Zap.  Univ.  Odessa,  vol.  1,  pp.  41-48  (1921). 
*[371]     Verzhbitskii,  B.  D.,  Some  problems  in  the  theory  of  series  compounded  from 

several  matrices.  Mat.  Sb.,  vol.  5,  pp.  505-12  (1939). 
*[372]     ViLENKiN,  N.  Y.,  On  an  estimate  for  the  maximal  eigenvalue  of  a  matrix,  U6. 

Zap.  Moscow  Ped.  Inst.,  vol.  108,  pp.  55-57  (1957). 
[373]     ViviER,  M.,  Note  sur  les  structures  unitaires  et  paraunitaires,  C.  R.  Acad.  Sci. 

Paris,  vol.  240,  pp.  1039-41  (1955). 
[374]     VoLTERRA,  v.,  Sui  fondamenti  della  teoria  delle  equazioni  differenziali  li/neari, 

Mem.  Soc.  Ital.  Sci.  (3),  vol.  6,  pp.  1-104  (1887) ;  vol.  12,  pp.  3-68  (1902). 
[375]     Walker,  A.  and  J.  Weston,  Inclusion  theorems  for  the  eigenvalues  of  a  normal 

matrix,  J.  London  Math.  Soc,  vol.  24,  pp.  28-31  (1949). 
[376]     Wayland,  H.,  Expansions  of  determinantal  equations  into  polynomial  form. 

Quart.  Appl.  Math.,  vol.  2,  pp.  277-306  (1945). 
[377]     Weierstrass,  K.,  Zur  theorie  der  bilinear  en  und  quadratischen  Formen,  Monatsh. 

Akad.  Wiss.  Berlin,  1867,  pp.  310-38. 
[378]     Wellstein,  J.,  t^ber  symmetrische,  alternierende  und  orthogonale  Normalformen 

von  Matrizen,  J.  Reine  Angew.  Math.,  vol.  163,  pp.  166-82  (1930). 
[379]     Weyl,  H.,  Inequalities  between  the  two  Icinds  of  eigenvalues  of  a  linear  trans- 
formation, Proc.  Nat.  Acad.  Sci.,  vol.  35,  pp.  408-11  (1949). 
[380]     Weyr,  E.,  Zur  Theorie  der  bilinearen  Formen,  Monatsh.  f.  Math,  und  Physik, 

vol.  1,  pp.  163-236  (1890). 
[381]     Whitney,  A.,  A  reduction  theorem  for  totally  positive  matrices,  J.  Analyse  Math., 

vol.  2,  pp.  88-92  (1952). 
[382]     Wielandt,  H.,  Bin  Einschliessungssatz  filr  chardkteristische  Wurzeln  normaler 

Matrizen,  Arch.  Math.,  vol.  1,  pp.  348-52  (1948/49). 
[383]     Die  Einschliessung   von  Eigenwerten  normaler  Matrizen,  Math.   Ann. 

vol.  121,  pp.  234-41  (1949). 
[384]     Unzerlegbare   nicht-negative   Matrizen,   Math.   Z.,   vol.   52,   pp.   642-48 

(1950). 


268  Bibliography 

[385]     Linear e  Scharen  von  Matrizen  mit  reellen  Eigenwerten,  Math.  Z.,  vol.  53, 

pp.  219-25  (1950). 
[386]     Pairs  of  normal  matrices  with  property  L,  J.  Res.  Nat.  Bur.  Standards, 

vol.  51,  pp.  89-90  (1953). 
[387]     Inclusion  theorems  for  eigenvalues,  Nat.  Bur.  Standards,  Appl.  Math. 

Sei.,  vol.  29,  pp.  75-78  (1953). 
[388]     An  extremum  property  of  sums  of  eigenvalues,  Proc.  Amer.  Math.  Soc, 

vol.  6,  pp.  106-110  (1955). 
[389]     On  eigenvalues  of  sums  of  normal  matrices.  Pacific  J.  Math.,  vol.  5, 

pp.  633-38  (1955). 
[390]     WiNTNER,  A.,  On  criteria  for  linear  stability,  J.  Math.  Mech.,  vol.  6,  pp.  301-9 

(1957). 
[391]     Wong,  Y.,  An  inequality  for  MinTcowsTci  matrices,  Proc.  Amer.  Math.  Soc,  vol.  4, 

pp.  137-41  (1953). 
[392]     On  non-negative  valued  matrices,  Proc.  Nat.  Acad.  Sci.  U.S.A.,  vol.  40, 

pp.  121-24  (1954). 
*[393]     Yaglom,  I.  M.,  Quadratic  and  sTcew-symmetric  'bilinear  forms  in  a  real  symplectic 

space,  Trudy  Sem.  Vect.  Tens.  Anal.  Moscow,  vol.  8,  pp.  364-81  (1950). 
*  [394]     Yakubovich,  V.  A.,  Some  criteria  for  the  reducibility  of  a  system  of  differential 

equations,  Dokl.  Akad.  Nauk  SSSR,  vol.  66,  pp.  577-80  (1949). 
*[395]     Zeitlin  (Tseitlin),  M.  L.,  Application  of  the  matrix  calculus  to  the  synthesis  of 

relay-contact  schemes,  Dokl.  Akad.  Nauk  SSSR,  vol.  86,  pp.  525-28  (1952). 
*[396]     ZiMMERMANN   (Tsimmerman),  G.  K.,  Decomposition  of  the  norm  of  a  matrix 

into  products  of  norms  of  its  rows,  Nau6.  Zap.  Ped.  Inst.  Nikolaevsk,  vol.  4, 

pp.  130-35  (1953). 


I 


INDEX 


f 


INDEX 

[Numbers  in  italics  refer  to  Volume  Two] 


Absolute  concepts,  184 
Addition  of  congruences,  182 
Addition  of  operators,  57 
Adjoint  matrix,  82 
Adjoint  operator,  265 
Algebra,  17 
Algorithm  of  Gauss,  23ff . 

generalized,  45 
Angle  between  vectors,  242 
Axes,  principal,  309 
reduction  to,  309 

Basis  (es),  51 

characteristic,  73 

coordinates  of  vector  in,  53 

Jordan,  201 
lower,  202 

orthonormal,  242,  245 
Bessel,  inequality  of,  259 
Bezout,  generalized  theorem  of,  81 
Binet-Cauchy  formula,  9 
Birkhofe,  G.  D.,  147 
Block,  of  matrix,  41 

diagonal,  isolated,  75 

Jordan,  151 
Block  multiplication  of  matrices,  42 
Bundle  of  vectors,  183 
Bun:^akovskii  's  inequality,  255 

Cartan,  theorem  of,  4 
Cauchy,  formula  of  Binet-,  9 

system  of,  115 
Cauchy  identity,  10 
Cauchy  index,  174,  216 
Cayley,  formulas  of,  279 
Cay  ley-Hamilton  theorem,  83,  197 
Cell,  of  matrix,  41 
Chain,  see  Jordan,  Markov,  Sturm 
Characteristic  basis,  73 
Characteristic  direction,  71 
Characteristic  equation,  70,  310,  338 
Characteristic  matrix,  82 
Characteristic  polynomial,  71,  82 


Characterization  of  root,  minimal,  319 

maximal-minimal,  321,  322 
Chebyshev,  173,  240 

polynomials  of,  259 
Chebyshev-Markov,  formula  of,  248 

theorem  of,  247 
Chetaev,  121 
Chipart,  173,  221 
Coefficients  of  Fourier,  261 
Coefficients  of  influence,  reduced.  111 
Column,  principal,  338 
Column  matrix,  2 
Columns,  Jordan  chains  of,  165 
Components,  of  matrix,  105 

of  operator,  hermitian,  268 
skew-symmetric,  281 

symmetric,  281 
Compound  matrix,  19ff.,  20 
Computation  of  powers  of  matrix,  109 
Congruences,  181,  182 
Constraint,  320 
Convergence,  110,  112 
Coordinates,  transformation  of,  59 

of  vector,  53 
Coordinate  transformation,  matrix  of,  60 

D  'Alembert-Euler,  theorem  of,  286 
Danilevskii,  214 

Decomposition,  of  matrix  into  triangular 
factors,  33ff . 

polar,  of  operator,  276,  286;  e 

of  space,  248 
Defect  of  vector  space,  64 
Derivative,  multiplicative,  133 
Determinant  identity  of  Sylvester,  32,  33 
Determinant  of  square  matrix,  1 
Diagonal  matrix,  3 
Dilatation  of  space,  287 
Dimension,  of  matrix,  1 

of  vector  space,  51 
Direction,  characteristic,  71 
Discriminant  of  form,  333 
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272 


Index 


Divisors,  elementary,  142, 144, 194 

admissible,  238 

geometrical  theory  of,  175 

infinite,  27 
Dmitriev,  87 
Domain  of  stability,  232 
Dynkin,  87 

Eigenvalue,  69 
Elements  of  matrix,  1 
Elimination  method  of  Gauss,  23f  f . 
Equivalence,  of  matrices,  61,  132,  133 

of  pencils,  strict,  24 
Ergodic  theorem  for  Markov  chains,  95 
Erugin,  theorem  of,  122 
Euler-D  'Alembert,  theorem  of,  286 

Factor  space,  183 
Eaddeev,  method  of,  87 
Field,  1 

Forces,  linear  superposition  of,  28 
Form,  bilinear,  294 
Hankel,  338 ;  ^^5 
hermitian,  244,  331 

bilinear,  332 

canonical  form  of,  337 

negative  definite,  337 

negative  semidefinite,  336 

pencil  of,  see  pencil 

positive  definite,  337 

positive  semidefinite,  336 

rank  of,  333 

signature  of,  334 

singular,  333 
quadratic,  246,  294 

definite,  305 

discriminant  of,  294 

rank  of,  296 

real,  294 

reduction  of,  299fe. 

reduction  to  principal  axes,  309 

restricted,  306 

semidefinite,  304 

signature  of,  296,  298 

singular,  294 
Fourier  series,  261 
Frobenius,  304,  339,  343 ;  53 

theorem  of,  343 ;  53 
Function,  entire,  1Q9 
left  value  of,  81 

Gantmacher,  103 
Gauss,  algorithm  of,  23ff. 
generalized,  45 
elimination  method  of,  23 £f. 


Gaussian  form  of  matrix,  39 
Golubchikov,  124 
Governors,  172,  233 
Gram,  criterion  of,  247 
Gramian,  247,  251 
Group,  18 

unitary,  268 
Gundenfinger,  304 

Hadamard  inequality,  252 

generalized,  254 
Hamilton-Cayley  theorem,  83,  197 
Hankel  form,  338;  205 
Hankel  matrix,  338;  205 
Hermite,  172,  202,  210 
Hermite-Biehler  theorem,  228 
Hurwitz,  173,  190,  210 
Hurwitz  matrix,  190 
Hyperlogarithm,  169 

Identity  operator,  66 
Imprimitivity,  index  of,  80 
Ince,  147 

Inertia,  law  of,  297,  334 
Integral,  multiplicative,  132,  138 

product,  132 
Invariant  plane,  of  operator,  283 

Jacobi,  formula  of,  302,  336 

identity  of,  114 

method  of,  300 

theorem  of,  303 
Jacobi  matrix,  99 
Jordan  basis,  201 
Jordan  block,  151 
Jordan  chains  of  columns,  165 
Jordan  form  of  matrix,  152,  201,  202 
Jordan  matrix,  152,  201 

Karpelevich,  87 
Kernel  of  ^.-matrix,  39 
Kolmogorov,  83,  87,  92 
Kotelyanskii,  103 

lemma  of,  71 
Krein,  221,  250 
Kronecker,  75 ;  25,  37,  40 
Krylov,  203 

transformation  of,  206 

Lagrange,  method  of,  299 
Lagrange  interpolation  polynomial,  101 
Lagrange-Sylvester  interpolation  polyno- 
mial, 97 
?v-matrix,  130 
kernel  of,  39 
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Lappo-Danilevskii,  168, 170,  171 

Left  value,  81 

Legendre  polynomials,  258 

Lienard,  173,  221 

Lienard-Chipart  stability  criterion,  221 

Limit  of  sequence  of  matrices,  33 

Linear  (in) dependence  of  vectors,  51 

Linear  transformation,  3 

Logarithm  of  matrix,  239 

Lyapunov,  173,  185 

criterion  of,  120 

equivalence  in  the  sense  of,  118 

theorem  of,  187 
Lyapunov  matrix,  117 
Lyapunov  transformation,  117 

MacMillan,  115 
Mapping,  afiine,  245 
Markov,  173,  240 

theorem  of,  242 
Markov  chain,  acyclic,  88 

cyclic,  88 

fully  regular,  88 

homogeneous,  83 
period  of,  96 

(ir)  reducible,  88 

regular,  88 
Markov  parameters,  233,  234 
Matricant,  127 
Matrices,  addition  of,  4 
group  property,  18 

annihilating  polynomial  of,  89 

applications   to    differential   equations, 
116ff. 

congruence  of,  296 

difference  of,  5 

equivalence  of,  132,  133 

equivalent,  61ff. 

left-equivalence  of,  132,  133 

limit  of  sequence  of,  33 

multiplication  on  left  by  H,  14 

product  of,  6 

quotient  of,  17 

rank  of  product,  12 

similarity  of,  67 

unitary  similarity  of,  242 

w^ith  same  real  part  of  spectrum,  122 

adjoint,  82,  266 
reduced,  90 

blocks  of,  41 

canonical  form  of,  63, 135,  136, 139, 141, 
152,  192,  201,  202,  264,  265 

cells  of,  41 

characteristic,  82 

characteristic  polynomial  of,  82 


Matrix,  column,  2 
commuting,  7 
companion,  149 
completely  reducible,  81 
complex,  Iff. 

orthogonal,  normal  form  of,  23 

representation  of  as  product,  6 

skew-symmetric,  normal  form  of,  18 

symmetric,  normal  form  of,  11 
components  of,  105 
compound,  19ff.,  20 
computation  of  power  of,  109 
constituent,  105 

of  coordinate  transformation,  60 
cyclic  form  of,  54 
decomposition  into  triangular  factors, 

33ff. 
derivative  of,  117 
determinant  of,  1,  5 
diagonal,  3 

multiplication  by,  8 
diagonal  form  of,  152 
dimension  of,  1 
elementary,  132 

elementary  divisors  of,  142,  144,  194 
elements  of,  1 
function  of,  95ff. 

defined  on  spectrum,  96 
fundamental,  73 
Gaussian  form  of,  39 
Hankel,  338 ;  205 

projective,  20 
Hurwitz,  190 
idempotent,  226 
infinite,  rank  of,  239 
integral,  126;  113 

normalized,  114 
invariant  polynomials  of,  139,  144,  194 
inverse  of,  15 

minors  of,  19ff. 
irreducible,  50 

(im)  primitive,  80 
Jacobi,  99 

Jordan  form  of,  152,  201,  202 
I,  130 

and  linear  operator,  56 
logarithm  of,  239 
Lyapunov,  117 
minimal  polynomial  of,  89 

uniqueness  of,  90 
minor  of,  2 

principal,  2 
multiplication  of,  by  number,  5 

by  matrix,  17 
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Matrix,  nilpotent,  226 
non-negative,  50 

totally,  98 
non-singular,  15 
normal,  269 

normal  form  of,  150,  192,  201,  202 
notation  for,  1 
order  of,  1 
orthogonal,  263 
oscillatory,  103 
partitioned,  41,  42 
permutable,  7 
permutation  of,  50 
polynomial,  see  polynomial  matrix 
polynomials  in,  permutability  of,  13 
positive,  50 

spectra  of,  53 

totally,  98 
power  of,  12 

computation  of,  109 
power  series  in,  113 
principal  minor  of,  2 
quasi-triangular,  43 
rank  of,  2 
reducible,  50,  51 

normal  form  of,  75 
representation  as  product,  264 
root  of  non-singular,  233 
root  of  singular,  234fE.,  239 
Eouth,  191 
row,  2 

of  simple  structure,  73 
singular,  15 
skew-symmetric,  19 
square,  1 

square  root  of,  239 
stochastic,  83 

fully  regular,  88 

regular,  88 
spur  of,  87 
subdiagonal  of,  13 
superdiagonal  of,  13 
symmetric,  19 
trace  of,  87 

transformation  of  coordinate,  60 
transforming,  35,  60 
transpose  of,  19 
triangular,  18,  218;  155 
unit,  12 

unitary,  263,  269 

unitary,  representation  of  as  product,  5 
upper  quasi-triangular,  43 
upper  triangular,  18 
Matrix  addition,  properties  of,  4 


Matrix  equations,  215ff. 

uniqueness  of  solution,  16 
Matrix  multiplication,  6,  7 
Matrix  polynomials,  76 

left  quotient  of,  78 

multiplication  of,  77 
Maxwell,  172 

Mean,  convergence  in,  of  series,  260 
Metric,  242 

euclidean,  245 

hermitian,  243,  244 
positive  definite,  243 
positive  semidefinite,  243 
Minimal  indices  for  columns,  38 
Minor,  2 

almost  principal,  102 

of  zero  density,  104 
Modulus,  left,  275 
Moments,  problem  of,  236,  237 
Motion,  of  mechanical  system,  125 

of  point,  121 

stability  of,  125 
asymptotic,  125 

Naimark,  221,  233,  250 
Nilpotency,  index  of,  226 
Norm,  left,  275 
of  vector,  243 
Null  vector,  52 
Nullity  of  vector  space,  64 
Number  space,  ?i-dimensional,  52 

Operations,  elementary,  134 
Operator  (linear),  55,  66 
adjoint,  265 
decomposition  of,  281 
hermitian,  268 

positive  definite,  274 

positive  semidefinite,  274 

projective,  20 

spectrum  of,  272 
identity,  QQ 
invariant  plane  of,  283 
matrix  corresponding  to,  56 
normal,  268 
positive   definite,  280 
positive  semidefinite,  280 
normal,  280 
orthogonal,  of  first  kind,  281 

(im) proper,  281 

of  second  kind,  281 
polar  decomposition  of,  276,  286 
real,  282 
semidefinite,  274,  280 


I 


Index 


275 


operator  (linear),  of  simple  structure,  72 

skew-symmetric,  280 

square  root  of,  275 

symmetric,  280 

transposed,  280 

unitary,  268 

spectrum  of,  273 
Operators,  addition  of,  57 

multiplictaion  of,  58 
Order  of  matrix,  1 
Orlando,  formula  of,  196 
Orthogonal  complement,  266 
Orthogonalization,  256 
Oscillations,  small,  of  system,  326 

Parameters,  homogeneous,  26 

Markov,  S33,  234 
Parseval,  equality  of,  261 
Peano,  127 
Pencil  of  hermitian  forms,  338 

characteristic  equation  of,  338 

characteristic  values  of ,  338 

principal  vector  of,  338 
Pencil (s)  of  matrices,  canonical  form  of, 
37,  39 

congruent,  41 

elementary  divisors  of,  infinite,  27 

rank  of,  29 

regular,  25 

singular,  25 

strict  equivalence  of,  24 
Pencil  of  quadratic  forms,  310 

characteristic  equation  of,  310 

characteristic  value  of,  310 

principal  column  of,  310 

principal  matrix  of,  312 

principal  vector  of,  310 
Period,  of  Markov  chain,  96 
Permutation  of  matrix,  50 
Perron,  53 

formula  of,  116 
Petrovskii,  113 

Polynomial (s),  annihilating,  176,  177 
minimal,  176 
of  square  matrix,  89 

of  Chebyshev,  259 

characteristic,  71 

interpolation,  97,  101,  103 

invariant,  139,  144,  194 

of  Legendre,  258 

matrix,  see  matrix  polynomials 

minimal,  89,  176,  177 

monic,  176 

scalar,  76 

positive  pair  of,  227 


Polynomial  matrix,  76,  130 

elementary  operations  on,  130,  131 
regular,  76 
order  of,  76 
Power  of  matrix,  12 
Probability,  absolute,  93 
limiting,  94 
mean  limiting,  96 
transition,  82 
final,  88 
limiting,  88 
mean  limiting,  96 
Product,  inner,  of  vectors,  243 
scalar,  of  vectors,  242,  243 
of  operators,  58 
of  sequences,  6 
Pythagoras,  theorem  of,  244 

QUASI-ERGODIC   THEOREM,   95 

Quasi-triangular  matrix,  43 
Quotients  of  matrices,  17 

Eank,  of  infinite  matrix,  239 

of  matrix,  2 

of  pencil,  29 

of  vector  space,  64 
Eelative  concepts,  184 
Eight  value,  81 
Eing,  17 
Eomanovskii,  83 
Eoot  of  matrix,  233,  234ff.,  239 
Eotation  of  space,  287 
Eouth,  173,  201 

criterion  of,  180 
Eouth-Hurwitz,  criterion  of,  194 
Eouth  matrix,  191 
Eouth  scheme,  179 
Eow  matrix,  2 

SCHLESINGER,  133 

Schur,  formulas  of,  46 
Schwarz,  inequality  of,  255 
Sequence  of  vectors,  256,  260 
Series,  convergence  of,  260 

fundamental,  of  solutions,  38 
Signature  of  quadratic  form,  296,  298 
Similarity  of  matrices,  67 
Singularity,  143 
Smirnov,  171 
Space,  coefficient,  232 

decomposition  of,  177,  248 

dilatation  of,  287 

euclidean,  242,  245 

extension  of,  to  unitary  space,  282 

factor,  183 


276 


Index 


Space,  rotation  of,  287 

unitary,  242,  243 

as  extension  of  euclidean,  282 
Spectrum,  96,  272,  273 ;  53 
Spur,  87 
Square  (s),  independent,  297 

positive,  334 
Stability,  criterion  of,  221 

domain  of,  232 

of  motion,  125 

of  solnution  of  linear  system,  129 
States,  essential,  92 

limiting,  92 

non-essential,  92 
Stieltjes,  theorem  of,  232 
Stodola,  173 
Sturm,  theorem  of,  175 
Sturm  chain,  175 

generalized,  176 
Subdiagonal,  13 
Subspace,  characteristic,  71 

coordinate,  51 

cyclic,  185 

generated  by  vector,  185 

invariant,  178 

vector,  63 
Substitution,  integral,  143, 169 
Suleimanova,  87 
Superdiagonal,  13 
Sylvester,  identity  of,  32,  33 

inequality  of,  66 
Systems  of  differential  equations,  applica- 
tion of  matrices  to,  116ff. 

equivalent,  118 

reducible,  118 

regular,  121, 168 

singularity  of,  143 

stability  of  solution,  129 
Systems  of  vectors,  bi-orthogonal,  267 

orthonormal,  245 

Trace,  87 
Transformation,  linear,  3 

of  coordinates,  59 

orthogonal,  242,  263 

unitary,  242,  263 

written  as  matrix  equation,  7 
Lyapunov,  117 
Transforming  matrix,  35,  60 
Transpose,  19,  280 
Transposition,  18 


Unit  sum  of  squares,  314 
Unit  sphere,  315 
Unit  vector,  244 

Value (s),  characteristic,  maximal,  53 
extremal  properties  of,  317 

latent,  69 

left  and  right,  of  function,  81 

proper,  69 
Vector (s),  51 

angle  between,  242 

bundle  of,  183 

Jordan  chain  of,  202 

complex,  282 

congruence  of,  181 

extremal,  55 

inner  product  of,  243 

Jordan  chain  of,  201 

latent,  69 

length  of,  242,  243 

linear  dependence  of,  51 
test  for,  251 
modulo  I,  183 

linear  independence  of,  51 

norm  of,  243 

normalized,  244;  66 

null,  52 

orthogonal,  244,  248 

orthogonalization  of  sequence,  256 

principal,  318,  338 

proper,  69 

projecting,  248 

projection  of,  orthogonal,  248 

real,  282 

scalar  product  of,  242,  243 

systems  of,  bi-orthogonal,  267 
orthonormal,  245 

unit,  244 
Vector  space,  50ff.,  51 

basis  of,  51 

defect  of,  64 

dimension  of,  51 

finite-dimensional,  51 

infinite-dimensional,  51 

nullity  of,  64 

rank  of,  64 
Vector,  subspace,  63 
Volterra,  133, 145, 147 
Vyshnegradskii,  172 

Weierstrass,  25 


4^SbWH  S^ICmT'DDR/M:^-' 


Due 


Date    Due, 
Retywied  Due 


Returned 


FEB  1  1  20fP  t  1  2013 


H«sR^Gtf^TEncrtiBR^Rf~ 


3  1262  00085  4470 


"Sb/, 


^. 


'% 


( 


^M 


